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TCL-lb GENE AND PROTEIN AND RELATED METHODS AND 
COMPOSITIONS 

This invention was made in part with government support under Grant 
5 numbers CA39880 and CAS 1083 awarded by the National Institutes of Health. 
The government has certain rights in the invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 

This application claims priority, in part, under 35 USC §119 based upon 
U.S. Provisional Patent Application No. 60/124,714 filed March 15, 1999. 

15 FIELD OF THE INVENTION 

The present invention relates to the field of molecular biology, more 
particularly to the isolation and characterization of a third member of the TCL1 gene 
family, specifically TCL-lb, also activated by chromosomal rearrangements in T cell 
20 leukemias. 

BACKGROUND OF THE INVENTION 

25 There is a close association between particular chromosomal abnormalities, 

e.g., chromosomal translocations, inversions, and deletions, and certain types of 
malignancy indicating that such abnormalities may have a causative role in the 
cancer process. Chromosomal abnormalities may lead to gene fusion resulting in 
chimeric oncoproteins, such as is observed in the majority of the tumors involving 

30 the myeloid lineage. Alternatively, chromosomal abnormalities may lead to 
deregulation of protooncogenes by their juxtaposition to a regulatory element active 
in the hematopoietic cells, such as is observed in the translocation occurring in the 
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lymphocytic lineage (Virgilio et aL, 1993, Proc. Natl. Acad. Sci. USA, 
90:9275-9279). 

Non random chromosomal translocations are characteristic of most human 
hematopoietic malignancies (Haluska et aL, 1987, Ann. Rev. Genet, 21:321-345) 
5 and may be involved in some solid tumors (Croce, 1987, Cell, 49:155-156). In B 
and T cells, chromosomal translocations and inversions often occur as a 
consequence of mistakes during the normal process of recombination of the genes 
for immunoglobulins (Ig) or T-cell receptors (TCR). These rearrangements 
juxtapose enhancer elements of the Ig or TCR genes to oncogenes whose expression 
10 is then deregulated (Croce, 1987, Cell, 41:155-156). In the majority of the cases, 
the rearrangements observed in lymphoid malignancies occur between two different 
chromosomes. 

The TCL-1 locus on chromosome 14 band q32.1 is frequently involved in the 
Z: chromosomal translocations and inversions with the T-cell receptor genes observed 

0* 15 in several post-thymic types of T-cell leukemias and lymphomas, including 

m T-prolymphocytic leukemias (T-PLL) (Brito-Babapulle and Catovsky, 1991, Cancer 

^ Genet. Cytogenet , 55:1-9), acute and chronic leukemias associated with the 

O immunodeficiency syndrome ataxia- telangiectasia (AT) (Russo et aL, 1988, Cell, 

y, 53:137-144; Russo et aL, 1989, Proc. Natl. Acad. Sci. USA, 86:602-606), and 

Jf | 20 adult T-cell leukemia (Virgilio et aL, 1993, Proc. Natl. Acad. Sci. USA, 

O 90:9275-9279). 

The TCL1 oncogene on chromosome 14q32.1 is also involved in the 
development of chronic T-cell leukemia in humans (T-CLL) and is activated in these 
leukemias by juxtaposition to the T-cell receptor a/5 locus, caused by 
25 chromosomal translocations, t(14;14)(qll;32), t(7;14)(q35;q32), or inversions 
inv(14)(qll;q32). Normally TCL1 expression is observed in early T-cell 
progenitors (CD4 CD8 CD3 ) and lymphoid cells of the B-cell lineage: pre B-cells 
and immature IgM expressing B-cells. Introduction of a TCL1 trans gene under the 
control of a Ick promoter caused mature T-cell leukemia in mice. (Virgilio et aL, 
30 1998, Proc Natl Acad Sci USA, 95:3885-3889). 

However, some cases of T-cell malignancies with abnormalities such as 
gene amplification at 14q32.1 did not show activation of the TCL1 expression, 
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suggesting that perhaps an additional oncogene may be located in 14q32.1. The 
second member of the TCL1 gene family, MTCP1, is located at Xq28 and 
activated in rare cases of mature T-cell leukemia with a t(X;14)(q28;qll) 
translocation. The present invention involves the isolation and characterization of 
the third member of the TCL1 gene family, TCLlb, located at 14q32.1 and also 
activated by rearrangements at 14q32.1 in T-cell leukemias. 

Rearrangements of the TCL-1 locus at chromosome 14q32.1 are unique, in 
that the other locus involved in these rearrangements, namely the TCR a/8 locus, is 
also on chromosome 14 at subband qll (Croce et aL, 1985, Science 
227:1044-1047; Isobe et aL, 1988, Proc Natl Acad Sci USA, 85:3933-3937). For 
this reason, the rearrangements observed cytogenetically are either chromosomal 
inversions, inv(14) (qll;q32), involving only one of the chromosomes 14 or 
translocations involving both, chromosomes 14 such as the t(14;14) (qll;q32), or 

more rarely, the t(7:14) (q35;q32) involving the TCR p locus at 7q35 (Isobe et aL, 
1988, Proc Natl Acad Sci USA, 85:3933-3937) . Several of the breakpoints at 
14q32.1 involved in these translocations have been cloned and characterized (Russo 
et al., 1988, Cell, 51:137-144; Baer, et al., 1987, Proc Natl Acad Sci, 
84:9069-9073; Mengle-Gaw et al., 1987, EMBO 1:2273-2280; Bertness et aL, 

1990, Cancer Genet Cy to genet , 44 : 47-54) . 

The TCL-1 locus, a chromosomal region of approximately 350 kb as 
determined by placement of translocation breakpoints on the long range genomic 
map, has recently been cloned (Virgilio, et aL, 1993, Proc Natl Acad Sci USA, 
90:9275-9279). The involvement of such a large region in translocation events 
suggests that activation of the putative TCL-1 gene may occur from a distance of 
many kilobases, as previously observed for the BCL—1/CCNDI gene in mantle cell 
lymphoma (Tsujimoto, et aL, 1984, Science 22,4:1403-1406; Rosenberg, et aL, 

1991, Proc Natl Acad Sci USA, 88:9638-9642; Withers, et aL, 1991, Mol Cell 
Biol, 11:4846-4853; Motokura and Arnold, 1993, Genes Chrom & Cancer, 
7:89-95) and the MYC oncogene in Burkitt lymphoma (Dalla-Favera, et aL, 1982, 
Proc Natl Acad Sci USA, 79:7824-7827; Nishikura, et aL, 1983, Proc Natl Acad 
Sci USA, 80:4822-4826) and in acute T-cell leukemia (Erikson, et aL, 1986, 
Science, 232:884-886). 



PATENT CRO01 .NP003 

Introduction of a TCL1 transgene under the control of the T-cell specific lck 
promoter into mice causes T-cell proliferative disorder and, at the age of 15 months, 
T-cell leukemia (Virgilio, L., et al., 1998, Proc Natl Acad Sci USA, 95:3885-3889). 
Another member of the TCL1 gene family is the MTCP1 gene on chromosome 
5 Xq28. MTCP1 is also activated in rare cases of T-cell leukemia by a 
t(X;14)(q28;qll) translocation (Soulier, J., et al., 1994, Oncogene, 9:3565-3570). 
In rare cases of mature T-cell leukemias with chromosomal abnormalities at 
14q32.1, activation of the TCL1 gene was not observed (Sakashita, K., et al., 1998, 
Leukemia, 12:970-971; Takizawa, J., et al., 1998, Jpn J Cancer Res, 89, 712-718). 

10 A second putative oncogene in this region was isolated, as described below, the 
TCLlb gene. This gene is located approximately 16 kb centromeric to TCL1 and 
shares 60% amino acid sequence similarity with TCL1. 

The expression profiles of both genes are very similar. TCL1 and TCLlb are 
expressed at very low levels in normal bone marrow and peripheral blood 

15 lymphocytes (Virgilio, L., et al., 1994, Proc Natl Acad Sci USA, 91:12530-12534; 
Pekarsky, Y., et al., 1999, Proc Natl Acad Sci USA, 96:2949-2951), but at higher 
levels in T-cell lines containing rearrangements of the 14q32.1 region (Virgilio, L., 
etal., 1994, Proc Natl Acad Sci USA, 91:12530-12534; Pekarsky, Y., et al., 1999, 
Proc Natl Acad Sci USA, 96:2949-2951). Since genes in close proximity to TCL1 

20 and TCLlb may also be activated in leukemias with rearrangements at 14q32.1, the 
chromosomal region bracketed by two previously published breakpoint cluster 
regions observed in T-cell neoplasias (Virgilio, L., et al., 1994, Proc Natl Acad Sci 
USA, 91:12530-12534; Virgilio, L., et al., 1993, Proc Natl Acad Sci USA, 
90:9275-9279) at 14q32.1was investigated for the presence of additional genes. 

25 The murine Tell locus was also examined in order to investigate the function 

of TCL1 and TCLlb, In the mouse the syntenic region of human chromosome 14q32 
is the region of the murine chromosome 12 proximal to the immunoglobulin locus. 
The murine Tell protein shows a 50% homology to the human Tell (Narducci, 
M.G., et al., 1997, Oncogene, 15:919-926) and is expressed in fetal hematopoietic 

30 organs and in immature T and B-cells as well as in adult spleen and thymus 

(Narducci, M.G., et al., 1997, Oncogene, 15:919-926). In order to identify other 
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members of the murine Tell family the murine Tell locus was also investigated for 
the presence of homologous genes. 

There remains an unfulfilled need to fully isolate and characterize the other 
member of the TCL-1 gene family, TCLlb, and the genes located very closely to 
5 TCLlb and TCL1; TNG1 and TNG2. The identification of additional oncogenes that 
are associated with chromosomal abnormalities causing T-cell leukemias and 
lymphomas further expands the efficacy by which a diagnostic and 
therapeutic/prophy lactic reagent will detect, treat, and prevent such disease states. 
The present invention fulfills this need by the identification and characterization of 
10 the TCLlb, TNG1 and TNG2 genes. 

Citation of references herein above shall not be construed as an admission 
that such references are prior art to the present invention. 



15 SUMMARY OF THE INVENTION 

The TCL1 gene family is implicated in the development of T-cell 
malignancies. The present invention discloses the identification and characterization 
of new members of this gene family, the TCL-lb, TNG1 and TNG2 genes. The 

20 present invention relates to the nucleotide sequences of TCLlb, TNG1 and TNG2, 
and amino acid sequences of their encoded Tel lb, Tngl, and Tng2 proteins, 
respectively, as well as derivatives and analogs thereof, and antibodies thereto. The 
present invention further relates to nucleic acids hybridizable to or complementary 
to the foregoing nucleotide sequences, as well as equivalent nucleic acid sequences 

25 encoding a Tcllb, Tngl or Tng2 protein. 

The present invention relates to expression vectors encoding a Tcllb, Tngl 
or Tng2 protein, derivative or analog thereof, as well as host cells containing the 
expression vectors encoding the Tcllb, Tngl or Tng2 protein, derivative or analog 
thereof. 

30 The present invention further relates to the use of TCLlb f TNG1 and TNG2 

genes and their encoded proteins as diagnostic and therapeutic tools for the detection 
and treatment of disease states associated with chromosomal abnormalities, 
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specifically abnormalities at 14q32.L In one embodiment of the present invention 
the use of nucleotide sequences of TCL-lb, TNG1 or TNG2 genes and amino acid 
sequences of their encoded Tcl-lb, Tngl, or Tng2 proteins, respectively, are used 
as diagnostic reagents or in the preparation of diagnostic agents useful in the 
detection of disease states, such as T-cell leukemias and lymphomas, associated with 
chromosomal abnormalities, in particular at 14q32.1, and/or increased levels of 
expression of the Tel lb, Tngl or Tng2 protein. 

The invention further relates to the use of nucleotide sequences of TCL-lb, 
TNG1 or TNG2 genes and amino acid sequences of their encoded Tel lb, Tngl or 
Tng2 protein, respectively, as therapeutic/prophylactic agents in the 
treatment/prevention of disease states, such as T-cell leukemias, associated with 
chromosomal abnormalities, in particular at 14q32.1, and/or increased levels of 
expression of the Tel lb, Tngl or Tng2 protein. 

The TCL-lb, TNG1 or TNG2 genes and Tel lb, Tngl or Tng2 protein 
sequences disclosed herein, and antibodies thereto, are used in assays to diagnose T- 
cell leukemias and lymphomas associated with chromosomal abnormalities, and/or 
increased expression of Tcllb, Tngl or Tng2 protein. 

The Tcllb, Tngl or Tng2 protein, or derivatives or analogs thereof, 
disclosed herein, are used for the production of anti-Tcllb, anti-Tngl or anti-Tng2 
antibodies, respectively, which antibodies are useful diagnostically in immunoassays 
for the detection or measurement of Tcllb, Tngl or Tng2 protein, respectively, in a 
patient sample. 

Another aspect of the present invention relates to methods of treatment of 
diseases or conditions associated with chromosomal abnormalities and/or increased 
expression of Tcllb, Tngl or Tng2 proteins. Abnormalities of chromosome 14, 
such as inversions and translocations, particularly at 14q32.1, are associated with T- 
cell leukemias and lymphomas. TCL-lb, TNG1 or TNG2 gene sequences and their 
protein products are used therapeutically in the treatment of disease states associated 
with chromosome 14 abnormalities. Anti-Tcllb, anti-Tngl or anti-Tng2 antibodies 
are used therapeutically, for example, in neutralizing the activity of an 
overexpressed Tcllb, Tngl or Tng2 protein, respectively, associated with disease. 
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Oligonucleotide sequences, including antisense RNA and DNA molecules 
and ribozymes, designed to inhibit the transcription or translation of TCL-lb, TNG1 
or TNG2 mRNA, are used therapeutically in the treatment of disease states 
associated with increased expression of Tel lb, Tngl or Tng2, respectively. 
5 Proteins, peptides and organic molecules capable of modulating activity of 

Tel lb, Tngl or Tng2 are used therapeutically in the treatment of disease states 
associated with aberrant expression of Tcllb, Tngl or Tng2. 

The present invention further relates to therapeutic compositions comprising 
Tcllb, Tngl or Tng2 proteins, derivatives or analogs thereof, antibodies thereto, 
10 nucleic acids encoding the Tcllb, Tngl or Tng2 proteins, derivatives or analogs, 
and TCL-lb, TNG1 or TNG2 antisense nucleic acid. 

The present invention further relates to methods of production of the Tcllb, 
Tngl or Tng2 proteins, derivatives and analogs, such as, for example, by 
recombinant means. 

15 

DESCRIPTION OF THE DRAWINGS 

Figure 1. Sequence comparison of Tell, Tel- lb and Mtcpl. Identities are shown in 
20 black boxes, similarities are shown in shaded boxes. For Tell and Mctp GenBank 
accession numbers are X82240 and Z24459, respectively. 

Figure 2 Figure 2. Genomic organization of the TCL1 and TCLlb genes. Vertical 
arrows refer to cloned 14q32.1 breakpoints. Restriction sites are given for BssHII 
25 (B), Clal (C), EagI (E), Sfil (F), Kspl (K), Mlul (M), NotI (N), Nrul (R) and Sail 
(S). Solid boxes represent TCL1 and TCLlb exons. 

Figure 3. Northern analysis of the TCL1 and TCLlb genes. (A). Human immune 
system Northern blot. Lanes 1-6: spleen; lymph node; thymus; peripheral blood 
30 leukocyte; bone marrow; fetal liver. (B). Human cancer cell line Northern blot. 
Lanes 1-8: promyelocytic leukemia, HL-60; Hela cells; chronic myelogenous 
leukemia, K-562; T-lymphoblastic leukemia, MOLT-4; Burkitt's lymphoma Raji; 
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colorectal adenocarcinoma, SW480; lung carcinoma, A549; melanoma, G361. (C). 
Lanes 1-6: Burkitt's lymphoma Raji; Burkitt's lymphoma Daudi; Burkitt's 
lymphoma CA-46; SupTll; bone marrow; placenta. (D). Lane 1: bone marrow; 
lanes 2-7, EBV transformed lymphoblastiod cell lines: Ado-1471; Ado-1476; Ado- 
5 1701; Ado-1727; Ado-2069; Ado-2199; lane 8: CA-46. (A-D). Top, TCLlb probe; 
middle, Tell probe; bottom, actin probe. 

Figure 4. RT-PCR analysis of the TCL1 and TCLlb genes. (A). Normal human 
tissues. Lanes 1-23: heart; liver; brain; muscle; placenta; kidney; lung; pancreas; 
spleen; lymph node; thymus; tonsil; peripheral blood lymphocytes (PBL); fetal 
liver; fetal brain; fetal lung; fetal kidney; fetal heart; fetal skeletal muscle; fetal 
spleen; fetal thymus; negative control. (B) Lanes 1-4, T cell PLL samples: 3047; 
3046; 3050; 3048. Lanes 5-6: bone marrow; PBL. (A-B). Top, TCLlb primers; 
middle, TCL1 primers; bottom, control G3PDH primers. 

Figure 5.: Genomic organization of human and mouse TCL1 loci. (A) Human TCL1 
locus. Vertical arrows refer to cloned 14q32.1 breakpoints (1, 7). Restriction sites 
are given for BssHII (B), Clal (C), EagI (E), Sfil (F), Kspl (K), Mlul (M), NotI 
(N), and Sail (S). Solid boxes represent exons of the four genes. (B) Striped boxes 
indicate translated parts of exons, white boxes indicate untranslated regions. Bold 
lines under the exons show various splicing products of TNG1, TNG2, and TCLlb 
genes. (C) Murine Tell locus. Restriction sites and exons are indicated as in (A). 

Figure 6. RT-PCR analysis of TNG1 and TNG2 genes. (A) Leukemia cell lines. 
25 Lanes 1-3: T-ALL cell lines: MOLT3; MOLT4; CEM. Lane 4: pre B-ALL cell line 
697. Lane 5: T-ALL cell line SupTll. Lane 6-8: Burkitt's lymphoma cell lines CA- 
46; Raji; Daudi. Lanes 9-10: bone marrow; peripheral blood lymphocytes (PBL). 
First panel, TCL1 primers; second panel, TCLlb primers; third panel, TNG1 
primers; fourth panel, TNG2 primers; bottom, control G3PDH primers (B). Normal 
30 human tissues. Lanes 1-23: heart; liver; brain; muscle; placenta; kidney; lung; 
pancreas; spleen; lymph node; thymus; tonsil; PBL; fetal liver; fetal brain; fetal 
lung; fetal kidney; fetal heart; fetal skeletal muscle; fetal spleen; fetal thymus; 
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negative control. (C) Lanes 1-4, T cell PLL samples: 3047; 3046; 3050; 3048. 
Lanes 5-6: bone marrow; PBL. (B-C). Top, TNG1 primers; middle, TNG2 primers; 
bottom, control G3PDH primers. 

5 Figure 7. Northern analysis of TNG1 and TNG2 genes. Lanes 1-3: Burkitt's 
lymphomas Raji; Daudi; CA-46; Lane 4: T-ALL SupTll; Lanes 5-6: bone marrow; 
placenta. Top, TNG1 probe; middle, TNG2 probe; bottom, actin probe. Each lane 
contains 3 jag of poly A + RNA. 

10 Figure 8, RT-PCR analysis of murine Tcllb genes. (A-B) Nested PCR, except (3- 
actin. The panels are in the same order. (A) Normal mouse tissues. Lanes 1-13: 
heart; brain; spleen; lung; liver; sceletal muscle; kidney; testis; 7-day embryo; 11- 
day embryo; 15-day embryo; 17-day embryo; negative control. (B) Lymphoid cell 
lines. Lanes 1-5: B-cell lines NFS-5; NFS-70; WEHI-279; MOPC-31C; MPC-11. 

15 Lanes 6-7: T-cell lines S49.1; BW5147. Lane 8-9: ES cells; negative control. (C) 
Single round of RCR. Lanes 1-4: ES cells, mouse oocytes; 2-cell embryos; negative 
control. 

Figure 9. Sequence comparison of human and murine Tell, Tcllb and Mtcpl 
20 proteins. Identities are shown in black boxes, similarities are indicated by shaded 
boxes. * mark the conserved residues of the inner hydrophobic core. 

Figure 10. Location of the insertion in human and murine Tcllb proteins: A side 
view of human Tell is shown in green. The Tel- lb insert into the C-D loop is 
25 shown in blue. 

DESCRIPTION OF THE INVENTION 

30 Methods 
Cell lines. 
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Cell lines, except EBV transformed lymphoblastoid cell lines, were obtained 
from ATCC (Rockville, MD) and grown in RPMI media with 10% fetal bovine 
serum. Lymphoblastoid cell lines were made from peripheral blood lymphocytes of 
patients with Alzheimer's disease by transformation with Epstein-Barr virus (EBV) 
5 as previously reported (Ounanian, A., et aL, 1992, Mech Ageing Dev, 63: 105-116). 

Human leukemia cell lines MOLT 3, MOLT 4, CEM, and SupTll (T-cell 
leukemias) and 697 (pre B-cell leukemia) and CA-46, Raji, and Daudi (Burkitt's 
lymphomas) were obtained from American Type Culture Collection (Manassas, 
VA.) Mouse lymphatic cell lines NFS-70 C-10 (pro B-cells), NFS-5 C-l and WEHI- 
10 279 (pre B-cells), MOPC-31C and MPC-11 (plasma cells), and S49.1 and BW 5147 
(thymocytes) were also purchased from American Type Culture Collection 
(Manassas, VA. ). All cell lines were grown in RPMI 1640 medium with 10% fetal 
bovine serum. 



15 Northern, Rapid Amplification of cDNA Ends (RACE) and Reverse Trans cripton- 
PCR (RT-PCR ) analysis. 

These experiments were carried out as previously described (Pekarsky, Y, et 
aL, 1998, Proc Natl Acad Sci USA, 95:8744-8749) with the following exceptions. 
Human bone marrow and placenta mRNAs, human immune system and human 

20 cancer cell line Northern blots were purchased from Clontech (Pato Alto, CA). 
Each line on Figure 3C and D contains 3 mg Poly A + RNA. PCR shown on 
Figure. 4A was carried out for 25-35 cycles using Multiple Tissue cDNA Panels 
(Clontech) and manufacturer's protocol. Primers were: top panel, TCI 
GGC AGCTCT ACCCCGGGATGA A , (SEQ. ID. NO: 1); and TC39 

25 ACAGACCTGAGTGGGACAGGA, (SEQ. ID. NO: 2); middle panel, TCLB 
TCCTCCTTGGC AGGAGTGGT A , (SEQ. ID. NO: 3); and TCLC 
C AGTT ACGGGTGCTCTTGCGT , (SEQ. ID. NO.: 4); lower panel, control 3' and 
5' RACE G3PDH primers (Clontech). Figure 4B, middle and bottom panels, 
primers were the same as above. Figure 4B, top panel. PCR was carried out for 22 

30 cycles with primers TC8 ATGGCCTCCGAAGCTTCTGTG, (SEQ. ID. NO: 5); 
and TC39. 0.1 ml of the reaction was used for the second PCR with nested primers 
TC10 TGGTCGTGCGGTTCAATCCCT, (SEQ. ID. NO: 6); and TC5 
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AATCTGGCC ATGGTCTGCTATTTC , (SEQ. ID. NO: 7); for 15 cycles. RACE 
primers were: TCI (for 3' RACE) and TC5 (for 5' RACE). 

Mouse and human tissue cDNAs for RT-PCR and RACE experiments were 
purchased from Clontech (Palo Alto, CA). Mouse egg and 2 cell embryo cDNA 
5 libraries for embryonic expression studies in mouse were previously described 
(Rothstein, J.L., et al., 1992, Genes Dev, 6:1190-1201). The DNAs from these 
libraries were diluted to the same concentration of cDNA as in mouse tissue 
samples. RNA extractions and reverse transcriptions from human and mouse cell 
lines and mouse embyonic stem cells were performed using Trizol™ reagent (Gibco 

10 BRL, Grand Island, NY). 2 jag of total RNA were transcribed into cDNA in a total 
volume of 20 ul using Superscript™ reverse transcription kit (Gibco BRL, Grand 
Island, NY) according to the manufacturers instructions. 1 ul of this reaction was 
used for PCR. RT-PCR for TNG1 was carried out with primers 1A 
TGC ATCCCTCC AGCC A AGGAT , (SEQ. ID. NO: 8); and 4A 

15 TGGCCTGC AGAGGCTCTC A AG , (SEQ. ID. NO: 9); for 25-35 cycles. For 
TNG2 primers 3B GTGCCTGTCTCATTCGCCTCTG, (SEQ. ID. NO: 10); and 
8B AGTGGGC ACATGTTAC AGC ATTC , (SEQ. ID. NO: 11); were used for the 
first round of 25 cycles and primers 4B GCATCCAGGACTGTGCCAGCA, (SEQ. 
ID. NO: 12); and 9B TTCTGTTAGCCTTGCTGTCCGT, (SEQ. ID. NO: 13); 

20 were used to amplify 0.1 ul of the first reaction in a nested PCR of 20 cycles. PCR 
conditions were 94°C denaturation for 30 sec, 54 to 62°C annealing for 30 sec and 
72°C extension for 30 sec. TCL1, TCLlb and, as control, G3PDH were amplified as 
described previously (6). RACE analysis was carried out in a nested reaction with 
30 cycles in the first round and 25 in the second. The primers were: TNG1: 1A and 

25 2A TTGAACCCAGGTCTCGTCTGAC, nested, (SEQ. ID. NO: 14); for 3' 
RACE and 3A AACGTAGGATGTGCACAGAGCA, (SEQ. ID. NO: 15); and 4A 
(nested) for 5' RACE and TNG2: 3B and 4B (nested) for 3' RACE and 8B and 9B 
(nested) for 5' RACE together with primers API and AP2 supplied by Clontech 
(Palo Alto, CA) fitting to the adapters of the cDNA. The murine Tcllb genes were 

30 amplified using the respective R reverse: 1R: GAGAACGGTCAGGACCCAAACC, 
(SEQ. ID. NO: 16);; 2R: C AGGCT ATCAAGACCTTTACTC , (SEQ. ID. NO: 
17);; 3/5R: TCAACCTCGCATATTACTATGTC, (SEQ. ID. NO: 18); 4R: 
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CAAAGGCACAAAGTGAGCAAGAG, (SEQ. ID. NO: 19); and F forward: IF 
A ATGTGGA A ACTTCTC ACTC AT , (SEQ. ID. NO: 20);; 2F 
ACTGGAAACTTGTTCTC ATTCAC , (SEQ. ID. NO: 21); 3/5F 
C ACTTGC AGC ATATGACC AC A AT , (SEQ. ID. NO: 22); 4F 
CCTGGTCTGC ACAAGAGATGA , (SEQ. ID. NO: 23); primers for 28 cycles 
Subsequently the respective R and FN forward nested: 1FN 
CTGTCCACTTGTGGAAGTTAAT, (SEQ. ID. NO: 24); 2FN 
C ACTTGTGGCAGATGACCAGATA , (SEQ. ID. NO: 25); 3/5FN 
CCAGGAGCCTACTCCCCAGCAG, (SEQ. ID. NO: 26); 4FN 
GTGGCAGATGACCACACTCTT, (SEQ. ID. NO: 27); primers were used in a 
seminested PCR for 25 cycles to amplify lul of the first reaction. PCR conditions 
were the same as described for human tissues. Due to the similarity of mouse Tcllb 
genes it was difficult to find specific primers for each of them. Subsequently Tcllb3 
and TcllbS were amplified with the same primers and sequenced to verify the 
expressed gene. However, in the case of embryonic tissue, unique forward primers 
were used to analyze the expression of TcllbS and TcllbS separately. The 
expression of both alternative first exons of Tcllb3 was verified using the primers 
3F horn homologous exon 1: CATTACTATGGCTGATTCAGTTC, (SEQ. ID. NO: 
28); and 3F alt alternative exon 1: GGAATGAGACTCTC AGGGC AC , (SEQ. ID. 
NO: 29); instead of 3/5F. RT-PCR for Tell was carried out similarly with 
primers TcllR, CCTGGGCAAGGCAGACAGGAGC, (SEQ. ID. NO: 30); and 
TCL1F, TGCTTCTTGCTCTTATCGGATG, (SEQ. ID. NO: 31); followed by a 
nested PCR using primers TcllRN, TTC ATCGTTGGACTCCGAGTC , (SEQ. ID. 
NO: 32); and TcllFN, AATTCC AGGTGATCTTGCGCC , (SEQ. ID. NO: 33);. 
The quality of the cDNA was verified by 25 cycles of p-actin RT-PCR using 
primers actR, GTACCACCAGACAGCACTGTG, (SEQ. ID. NO: 34); and actF, 
GACCCAGATCATGTTTGAGACC, (SEQ. ID. NO: 35); RACE analysis from 
mouse tissues was performed as described above for human tissues. The specific 
primers were: allR, AAGCCATCTATAAGGTCAGG, (SEQ. ID. NO: 36); for 
the first step and the respective R primers for the nested step of 5 ' RACE and the 
respective F (first) and FN (nested) primers for 3' RACE. 
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Pulsed-Field Gel Electrophoresis (PFGE) analysis and chromosomal localization. 

PFGE analysis was performed as described (Pekarsky, Y., et al., 1998, Proc 
Natl Acad Sci USA, 95:8744-8749), except pulse time was 1-6 second for 11 hours. 
Chromosomal localization of the TCLlb gene was carried out using GeneBridge 4 
5 radiation hybrid mapping panel (Research Genetics, Hunts ville, AL) according to 
the manufacturer's protocol. Primers were TCI and TC4, 
TGCT AGGACC AGCTGCTCC AT AGA , (SEQ. ID. NO: 37). 

Sequencing. 

10 Products from RACE and RT-PCR experiments were cut and extracted from 

agarose gels using a QIAquick gel extraction kit (Qiagen, Valencia, CA) according 
to the manufacturer's instructions. Subsequently they were sequenced using an 
automated sequencer model 377 (Perkin Elmer, Foster City, CA). A human 
Bacterial Artificial Chromosome library (BAC) (277A8) was partially digested with 

15 Sau3A and TSP509I and cloned into a pUC18 vector using standard methods. 100 
random clones were isolated and sequenced from both ends using a 377 automated 
sequencer. The DNA sequences were compared to the expressed sequence tag (EST) 
database. The mouse BAC 452-124 was sequenced and analyzed as described 
previously (Inoue, H., etal., 1997, Proc Natl Acad Sci USA, 94:14584-14589). EST 

20 clones were purchased from Research Genetics (Huntsville, AL) and sequenced. 

Northern blot and pulse-field gel electrophoresis (PFGE) for TNG gene 

Total RNA for Northern blot experiments was isolated as described above. 
PolyA+ RNA isolation, Northern blotting and hybridization was performed as 
25 previously described (Hallas, C, et al., 1999, Clin Cancer Res, In press). TNG1 
and TNG2 probes were generated by RT-PCR. PFGE analysis was performed as 
described 7 using BAC (277A8) DNA and TNG1, TNG2, TCL1, and TCLlb probes. 

Protein structure. 

30 A computer model was created for the human and murine Tel lb proteins 

based on their similarity to Tell. The atomic coordinates for human TCL1 are 
derived from the crystal structure (Hoh, F., et al., 1998, Structure, 6:147-155). 
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The initial sequence alignment was generated by maximizing the correlation between 
the sequences. Modeling and analysis were done using Insightll (Biosym, San 
Diego, Ca.). 

5 

Results 

Identification of the TCLlb gene. 

In some mature T-cell leukemias with chromosomal abnormalities at 

10 14q32.1, activation of the TCL1 gene at 14q32.1 was not observed (Takizawa, J., et 
aL, 1998, Jpn J Cancer Res, 89:712-718; Sakashita, et al., 1998, Leukemia, 
12:970-971). To investigate the possibility that other, unknown TCL1 family 
member(s) may be involved, we searched the EST database for sequences 
homologous to the TCL1 and MTCP1 gene products. A single EST (accession 

15 number AA689513) was found to be homologous, but not an exact match to both 
genes. Thus, a ~ 1.2 kb full length cDNA (SEQ. ID. NO: 38) was isolated using 5' 
and 3' RACE procedure and human testis mRNA as a cDNA source. The 1.2 kb 
TCLlb cDNA encodes a 14 kDa protein of 128 amino acids (SEQ. ID. NO: 39) 
(Figure 1). It contains a starting ATG codon at position 28 within a perfect Kozak 

20 consensus sequence. The Tcllb protein has a 14 amino acid insertion compared to 
the Tell and Mtcpl proteins (Figure 1); it is 30% identical and 60% similar to 
Tell, and 36% identical and 63% similar to Mtcpl (Figure 1). 

A radiation hybrid mapping panel (GeneBridge 4) was used to determine the 
chromosomal localization of the human TCLlb gene. By analysis of PCR data at the 

25 MIT database (http://www-genome.wi.mit.edu), the TCLlb gene was localized to 
3.05 cR from the marker D14S265, at 14q32. A TCLlb pseudogene and localized it 
to 5ql2-5ql3. The TCLlb pseudogene does not have the initiating ATG or introns 
and has a stop codon in the middle of the open reading frame. 

TCL1 and TCLlb are both located at 14q32, therefore, a determination was 

30 made as to whether TCL1 and TCLlb are physically linked. The human bacterial 
artificial chromosome (BAC) library and found several BAC clones containing TCL1 
and TCLlb. The TCLlb gene (SEQ. ID. NO: 40) is 6.5 kb in size and contains 4 
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exons of 189, 171, 69 and 697 bp respectively (Figure 2), but only the first three 
exons are coding. Pulsed field analysis of the positive BAC clone with both probes 
revealed that the TCL1 and TCLlb genes have opposite directions of transcription 
and are separated only by 16 kb (Figure 2). Both genes are located in the — 160 kb 
5 region between previously published two sets of breakpoints observed in T-cell 
acute lymphoblastic leukemia (ALL) cases with translocations or inversions at 
14q32.1 (Virgilio, L., et aL, 1994, Proc Natl Acad Sci USA, 91:12530-12534; 
Virgilio, L., etal., 1993, Proc Natl Acad Sci USA, 90:9275-9279). 

10 Expression of TCLlb gene and its activation in T-cell malignancies. 

Because of the similarities between the TCL1 and TCLlb genes in their 
structure, sequence, and location, it seemed possible that they would exhibit similar 
expression patterns. To verify this, we carried out a series of Northern and RT-PCR 
experiments (Figures 3 and 4). Northern analysis in normal tissues was mostly 

15 negative for TCLlb (Figure 3A), except that the 1.2 kb transcript was detected after 
several days exposure in testis and placenta (Figure 3C). The TCL1 gene 
expression, however, was detected in most hematopoietic tissues after several days 
exposure (Figure 3A). Semiquantitative RT-PCR analysis (Figure 4A) revealed 
that both TCL1 and TCLlb genes are expressed in spleen, tonsil, fetal liver, fetal 

20 kidney, and fetal thymus. However, the TCLlb gene is expressed in wider variety of 
tissues including placenta, kidney and fetal spleen (Figure 4A). Northern analysis of 
commercial human cancer cell lines showed that TCL1 and TCLlb are expressed in 
only the Raji Burkitt lymphoma cell line (Figure 3B), although TCL1 was expressed 
at a much higher level (Figure 3B). 

25 The TCL1 and TCLlb genes have similar transcription patterns and are 

physically linked. Therefore, a determination as whether the TCLlb gene could also 
be activated by rearrangements in 14q32 was made. Figures 3C and 3D show the 
activation of the TCLlb gene in a T-leukemia cell line with a translocation at 
14q32.1 (SupTll) compared with the normal bone marrow and with EBV 

30 transformed lymphoblastiod B cell lines expressing TCLL (Figures 3C and 3D, 
middle panels). Since TCL1 and TCLlb are normally not expressed in post-thymic 
T-cells and post-thymic T-cell leukemias lacking 14q32.1 abnormalities (for 
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example, in T-ALL MOLT4 with no abnormalities at 14q32.1, Fig. 3B, lane 4), the 
expression of TCL1 and TCLlb in SupTll cells carrying a t(14;14)(qll;q32,l) 
translocation indicates that juxtaposition of TCL1 and TCLlb to the a/8 locus of the 
T- cell receptor deregulates both genes. 
5 To further investigate TCLlb expression, four T-cell leukemias and six EBV 

transformed lymphoblastoid cell lines with elevated levels of TCL1 were analyzed. 
Figure 4B shows the activation of the TCLlb expression in one leukemic sample 
from a patient with T-cell prolymphocycic leukemia. Human T-cell prolymphocytic 
leukemias carry the 14q32.1 translocation or inversion and overexpress TCL1 
10 (Virgilio, L., et al., 1994, Proc Natl Acad Sci USA, 91:12530-12534; Narducci, 
M.G., et al., 1997, Cancer Res, 57:5452-5456). The TCLlb gene was also 
expressed in two out of six EBV transformed lymphoblastoid B cell lines (Figure 
3D, upper panel, lanes 2-7). 

15 The human TCL1 locus. 

The TCL1 and TCLlb genes are both located on chromosome 14q32.1 within 
a — 160 kb region between two previously published breakpoint cluster regions 
observed in T-cell neoplasms (Virgilio, L., et al., 1994, Proc Natl Acad Sci USA, 
91:12530-12534; Virgilio, L., et al., 1993, Proc Natl Acad Sci USA, 90:9275- 

20 9279). Both genes are activated by translocations and inversions involving 14q32.1 
(Pekarsky, Y., et al., 1999, Proc Natl Acad Sci USA, 96:2949-2951). To investigate 
whether other, unknown genes within this region are also activated by the same 
rearrangements a previously isolated bacterial artificial chromosome library (BAC) 
of 110 kb (277A8, ref. 6) covering the majority of this region was analyzed. This 

25 BAC was partially digested with the restriction enzymes Sau3A and TSP509I and 
cloned into a pUC18 vector. 100 clones (the equivalent of the length of the BAC) 
were picked randomly and sequenced from both sides. These sequences were 
compared to the expressed sequence tag (EST) database and two different sets of 
ESTs homologous to the BAC sequences were found. Two full length cDNAs 

30 using 3 X and 5 s RACE and RT-PCR of cDNA from human testis, peripheral blood 
lymphocytes, and the Burkitt's lymphoma cell line Raji were isolated using primers 
made from the different ESTs. The 1.5 kb cDNA of the TCL1 neighboring gene 1 
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(TNG1) (SEQ. ID. NO: 41) contains an open reading frame coding for a protein of 
141 amino acids (SEQ. ID. NO: 42) with the start codon ATG at position 161. The 
2 kb cDNA of TCL1 neighboring gene 2 (TNG2) (SEQ. ID. NO: 43) encodes a 
shorter protein of 110 amino acids (SEQ. ID. NO: 44) with the start codon at 
position 36. Both genes do not show homology to any known genes found in the 
database. Relative positions of the genes and their distances from each other were 
determined by Southern hybridization and pulse field Southern analysis. TNG2 is 
located 8 kb centromeric of TCLlb and TNG1 is only 118 bp centromeric of TNG2, 
TNG1, TNG2 and TCLlb have the same transcriptional orientation, opposite to 
TCL1 (Figure 5A). The TNG1 gene is 4.5 kb (SEQ. ID. NO: 45) in size and 
contains only two exons of 215 and 1239 bp. The TNG2 gene has a size of 8.6 kb 
(SEQ. ID. NO: 46) containing four exons of 134, 136, 157, and 1651 bp, all of 
which are coding. RT-PCR and RACE experiments revealed several alternatively 
spliced RNAs linking various exons of TNG1 and TNG2 to exon 2 of TCLlb 
(Figure 5B). Only one of these RNAs, linking the exon 1 of TNG1 in frame to the 
second exon of TCLlb, contains a new open reading frame encoding a TCLlb 
protein with an alternative N-terminal end. 

The murine Tell locus. 

In order to identify the murine Tcllb gene the murine expressed sequence tag 
(EST) database was searched for sequences homologous to human TCLlb. Three 
sets of ESTs were found that were very similar, but not identical, to each other and 
showed homology to human TCLlb, Additionally, a bacterial artificial chromosome 
(BAC) library was screened and three clones containing murine Tell were obtained. 
PCR analysis of these BAC clones confirmed the presence of all three EST 
sequences. By a combination of RACE and RT-PCR experiments, database analysis 
of EST sequences and sequencing of selected EST clones, full length cDNAs 
corresponding to these sequences were isolated. 

Because of the shared similarity among the cDNAs it was not possible to 
obtain unique probes for each. Thus, the genomic structure of the region by 
conventional methods such as Southern hybridization and pulse field gel analysis 
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could not be determined. Subsequently, the BAC (452-124) was sequenced and the 
position and the exon-intron boundaries of the three cDNAs was determined. 

Further analysis of the region also revealed that it contains two other 
sequence related genes. RT-PCR experiments with specific primers for these two 
5 genes confirmed that they are transcribed. Altogether, five full length cDNAs 
(SEQ. ID. NO: 47-51) were isolated located on murine chromosome 12 centromeric 
to the Igh locus homologous to human TCLlb. Murine Tcllbl - TcllbS cDNAs had 
a length of — 1 kb (SEQ. ID. NO: 47-51, respectively) encoding for proteins 
ranging in size from 117 - 123 amino acids (SEQ. ID. NO: 57-63, respectively). 
10 They share 70-90% nucleic acid homology and 55-75% amino acid identity and 65- 
80% amino acid similarity. The murine Tel lb family members show —25% identity 
and —35% similarity to murine Tell and are 25-30% identical and 30-40% similar 
to human TCLlb. 

The five genes are aligned on murine chromosome 12 (Figure 5C) in the 

15 order Tcllbl, Tcllbl, TcllbS, Tcllb3, and Tcllb4 with distances of 4.5 kb, 9.7 kb, 
9.9 kb, and 6.8 kb, respectively, from each other and 9.8 kb between Tcllb4 and 
TclL The total sizes of the genes are: Tcllbl: 6.9 kb, (SEQ. ID. NO: 52); Tcllbl: 
8.2 kb, (SEQ. ID. NO: 53); Tcllb3 (SEQ. ID. NO: 54); and Tcllb4: 4.6 kb, (SEQ. 
ID. NO: 55); TcllbS: 4.8 kb(SEQ. ID. NO: 56).. The direction of transcription of 

20 Tcllbl - TcllbS is opposite to that of TclL Each of the murine Tcllb genes contains 
four exons of approximately 200, 170, 70, and 590 bp in size. The only exceptions 
are the exons 3 of Tcllbl and Tcllb4, in which a different splicing site leads to a 
transcript 29 bp shorter. In addition, sequences of RT-PCR and RACE products and 
ESTs derived from Genebank showed alternatively spliced cDNAs for Tcllbl and 

25 Tcllb 3, Tcllbl may have a deletion of 73 bp consisting of nearly the complete exon 
3 and the first 6 bp of exon 4. Because this deletion includes the stop codon the 
deduced protein sequence is slightly longer (Tcllbla, SEQ. ID. NO: 58). For 
Tcllb3 an alternative exon 1 was found leading to a shorter protein (Tcllb3a, SEQ. 
ID. NO: 61) with an alternative N-terminal end without homology to other Tcllb 

30 proteins. 

Although the homology of murine Tcllb proteins (SEQ. ID. NO: 57 - 63) to 
human Tel- lb (SEQ. ID. NO: 39) is lower than typically observed between mouse 
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and human homologues (70-100%), the position of the genes on the map, their 
direction of transcription and their exon-intron structure are similar to the human 
TCLlb locus and indicate that these genes are authentic homologues to the human 
TCLlb gene (SEQ. ID. NO: 40). 

5 

Expression of human TNG1 and TNG2. 

??Because TNG1 and TNG2 are located at the same locus as TCL1 and 
TCLlb, it seemed possible that they would exhibit similar expression patterns. To 
investigate this, a series of Northern blot and RT-PCR experiments were performed. 
10 TNG1 and TNG2 are both transcribed in a wide variety of normal tissues (Figure 
6B). The results demonstrate a low level of expression in most tissues examined 
including placenta, kidney, fetal kidney, fetal lung, and fetal heart and all lymphoid 
^ tissues including fetal liver and fetal spleen. The only exception is thymus, which 

U1 only showed transcripts of TNG2, whereas fetal thymus only expressed TNG1 

m 15 (Figure 6B). TCLlb was expressed in the same tissues as TNG1 except thymus, 

E fetal lung, and fetal heart ???(Virgilio, L., et aL, 1998, Proc Natl Acad Sci USA, 

95:3885-3889). Northern blot analysis of normal adult and embryonic tissues was 
p negative for TNG1 and TNG2, probably due to the low level of expression. 

Because of the similarity of transcription patterns of TNG1 and TNG2 to 
U1 20 those of TCL1 and especially TCLlb, and the physical linkage of these genes, the 

p activation of the TNG genes by rearrangements at 14q32.1 was investigated. Figure 

6A demonstrates that all four genes show an identical expression pattern in 
lymphoid tumor cell lines. They are all expressed in early B-tumor cell lines (697, 
Raji, Daudi, and CA-46), but not in postthymic T-cell lines without 14q32.1 
25 rearrangements. Nevertheless, TCL1, TCLlb, TNG1, and TNG2 are all transcribed 
in the T-ALL cell line SupTll carrying a t(14;14)(qll;q32) translocation. Northern 
blot experiments confirmed these transcription patterns (Figure 7). The 1.5 kb 
transcript of TNG1 was found in Burkitt's lymphoma cell lines Daudi and CA-46 
and to a lesser extent also in the Raji cell line and in the T-cell acute lymphocytic 
30 leukemia cell line SupTll (T-ALL) that carries a 14q32.1 translocation. The second 
band, a —2.3 kb transcript is likely to be a product of alternative splicing or 
incompletly processed hnRNA. However, activation of TNG2 in the SupTll cell 
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line was not confirmed by Northern blotting, due to a lower expression level. The 2 
kb TNG2 transcript was detected in all three Burkitt's lymphoma cell lines, but not 
in the pre B-cell line 697 or in any of the T-cell lines investigated. The diffuse 
signal around the bands is due to the various alternative splicing products known to 

5 involve these gene. 

To further study the activation of TNG1 and TNG2 by rearrangements at 
14q32.1 the expression of TNG1 and TNG2 was investigated in four T-cell 
prolymphocytic leukemias (T-PLL) overexpressing TCL1. Figure 6C shows the 
activation of both genes in 2 out of 4 cases. The transcripts of TNG1 and TNG2 

10 were detected after 27 cycles of PCR in these two cases even though at these 
conditions bone marrow and peripheral blood lymphocytes were negative. 
Interestingly, activation of TCLlb in one of the two cases not expressing the TNG 
genes ???(Pekarsky, Y., et al., 1999, Proc Natl Acad Sci USA, 96:2949-2951) was 
previously found. These results indicate that juxtaposition of the TCL1 locus at 

15 14q32.1 to the a/5 locus of the T-cell receptor activates TNG1 and TNG2, as well as 
TCL1 and TCLlb. 

Expression of murine Tcllb genes. 

To investigate the expression pattern of the murine Tcllb genes a series of 

20 RT-PCR experiments was carried out for each of the five genes. After a single 
round of PCR no mRNA expression was found in a series of normal tissue, 
embryonic cDNA libraries, and lymphoid cell lines for any of the five genes. 
However, nested PCR analysis revealed a low level of expression of Tcllb2 in all 
lymphoid cell lines (Figure 8B) and nearly all normal tissues. Further, Tcllb2 

25 expression increased during embryonic developement (7 - 17 days old embryos, 
Figure 8A). A low level of expression was also found for Tcllb 1 in nearly all 
lymphoid cell lines, but not in any other tissues. Tcllb4 only showed a low level of 
expression in testis and in the pro B-cell line NFS 5, which also expressed Tcllb3, 
as confirmed by sequencing (Figure 8). TcllbS expression was not detected in any 

30 tisuue or cell line examined. In comparison to the Tcllb genes, Tell was expressed 
at low levels in testis, 11 and 15 days old embryos and in the thymocyte cell line 
S49-1. Interestingly, murine Tell was not detected in any of the early B-cell lines, 
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although early B-cells show expression of TCL1 in humans (Virgilio, L., et al., 
1994, Proc Natl Acad Sci USA, 95:3885-3889; Virgilio, L., et al., 1998, Proc Natl 
AcadSci USA, 95:3885-3889). 

Since the original three sets of ESTs all derived from a 2 cell embryonic 
5 cDNA library where they make up —0.5% of the total ESTs, cDNA from mouse 
embryonic stem (ES) cells, oocytes, and 2 cell embryos was investigated for the 
expression of Tcllb genes. After a single round of RT-PCR, expression of all five 
Tcllb genes and Tell was found in mouse oocytes and 2 cell embryos at a level 
comparable to that of P-actin expression (Figure 8C). In 2 cell embryos, both 

10 splicing variants of Tcllbl were amplified. Interestingly, in the mouse oocyte cDNA 
library only a shorter transcript of Tell was detected, missing a part of exon 2. Only 
Tell showed expression in ES cells after a single round of PCR, but nested PCR 
revealed a low level of expression also of Tcllbl, Tcllbl, and Tcllb4. The high 
expression of all five Tcllb genes and Tell in mouse oocytes and 2 cell embryos 

15 implies that an important function of these genes occurs in the early embryogenesis 
of the mouse. 

Protein structure of TCL1 family. 

Tell and Mtcpl proteins both consist of an eight-stranded antiparallel (3- 

20 barrel with a hydrophobic core and are predicted to bind small hydrophobic ligands 
(Fu, Z.Q., et al., 1998, Proc Natl Acad Sci USA, 95:3413-3418). Amino acid 
sequence alignment of these proteins with human and mouse Tcllb (Figure 9) shows 
that, despite only an overall 30-40% homology, all 14 amino acids forming the 
hydrophobic core are conserved except Pro36. 10 of these 14 amino acids are 

25 identical in all 10 members of the Tell family, whereas three residues show 
conservative substitutions in some of the proteins (Leu49 -> Val, Leu92 -> He, 
Met 104 -> Leu). Therefore, those residues have an important function in all Tell 
family members. 

Human Tcllb (SEQ. ID. NO: 39) shows a 14 residue insertion (Arg44 - 
30 Glu58) relative to human Tell (Figure 9). Mouse Tcllb has a smaller, 10-11 
residue insertion in the same position. A molecular model was built for human and 
murine Tcllb based on the 35% similarity in amino acid sequence to Tell. In this 
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model, the Tcllb insertion aligns with a non-canonical, 5 residue turn (Lys42 - 
Gln46) observed in the crystal structure of human Tell (Hoh, F., et al., 1998, 
Structure, 6:147-155). The additional residues in human and mouse Tcllb may form 
a surface accessible beta-sheet extension or a flexible loop with conserved charged 
amino acids (Figure 10). 

Discussion 

The present invention discloses the cloning, mapping and expression analysis 
of a novel member of the TCL1 gene family, TCLlb. The TCL1 and TCLlb genes 
are physically linked, show structural similarity, similar expression patterns and 
involvement in T-cell malignancies. Because the remaining two members of the 
TCL1 family are oncogenes (Virgilio, L., et al., 1998, Proc Natl Acad Sci USA, 
95:3885-3889; Gritti, C, et al., 1998, Blood, 92:368-373), it seems likely that 
TCLlb is also an oncogene. It is also likely that TCLlb activation would explain 
cases of T-cell leukemia with amplification at 14q32 without activation of TCLL 

It is possible that two TCL1 genes are the result of duplication, although the 
TCLlb gene is slightly more homologous to the MTCP1 gene at Xq28 than to the 
TCL1 gene. 

Neither the in vivo function of Tell, nor the mechanism(s) of its oncogenic 
potential is known, although its crystal structure (Fu, Z.Q., et al., 1998, Proc Natl 
Acad Sci USA, 95:3413-3418) suggests, it may function as a transporter of small 
molecules, such as retinoids, nucleosides or fatty acids. The same study (Fu, Z.Q., 
et al., 1998, Proc Natl Acad Sci USA, 95:3413-3418) suggested that Tell might 
function as dimer, implying the possibility that Tell and Tcllb might form 
heterodimers. 

Since TCL1 and MTCP1 transgenic mice develop mature T-cell leukemia 
only after 15 months (Virgilio, L., et al., 1998, Proc Natl Acad Sci USA, 95:3885- 
3889; Gritti, C., et al., 1998, Blood, 92:368-373), it will be of considerable interest 
to determine whether TCLlb transgenic mice also develop mature T-cell leukemia 
late and whether TCL1 and TCLlb double transgenic mice develop leukemia faster. 
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Thus, is seems possible that translocations and inversions at 14q32.1 contribute to 
malignant transformation by activating two oncogenes at the same time. 

The present invention discloses the cloning, mapping, and expression 
analysis of the human and murine TCLl/Tcll locus. Human TCL1 and TCLlb genes 
(SEQ. ID. NO: 40); are located between two clusters of chromosomal breakpoints 
and are activated by translocations and inversions at 14q32.1 juxtaposing them to 
regulatory elements of T-cell receptor genes (Pekarsky, Y., et al., 1999, Proc Natl 
Acad Sci USA, 96:2949-2951). Between these two sets of breakpoints two new 
genes were found and characterized, TNG1 and TNG2 (SEQ. ID. NO: 45 and 46, 
respectively);. Both show no homology to any known genes, but similar expression 
patterns to that of TCL1 and TCLlb. Both TNG genes are also activated in the T- 
cell leukemia cell line SupTll carrying a t(14;14) translocation, and in two out of 
four T-PLL samples. Therefore, like TCL1 and TCLlb, these two genes are also 
activated by rearrangements at 14q32.1 involved in T-cell malignancies. Thus, T- 
cell leukemias, in some cases, are induced by the activation of a single gene or in 
others by the cumulative activation of two or more of these four genes, although the 
oncogenic potential of TNG1 and TNG2 remains to be determined. 

To assist in the structural and functional analysis of the human TCL1 gene 
activation, the murine Tell locus was searched for homologues to human TCLlb and 
TNG genes. Five genes homologous to human TCLlb were found. The high shared 
similarity between the five murine Tcllb genes (SEQ. ID. NO: 52-56), not only in 
the exons but also in intronic sequences, implies that they are most likely a result of 
duplications. All five genes are transcribed into mRNA but it remains to be 
determined whether they all code for active proteins or whether some of them might 
be pseudogenes. Their genomic structure, though, is untypical for pseudo genes, 
since it includes introns. 

The five Tcllb genes show different expression patterns, suggesting different 
regulatory elements for each of them, but since the expression of all of the genes in 
all adult tissues and cell lines is very low, the significance of this is not clear. 
Interestingly, the expression of murine Tell and Tcllb genes in lymphoid tissues and 
cell lines is much lower than the expression of their human homologues. The most 
striking feature of murine Tell and Tcllb genes is their very high expression level 
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(up to 0.5% of all mRNA) in mouse oocytes and 2-cell embryos. This finding is 
consistent with the presence of human TCLlb in a syncytiotrophoblast subtracted 
cDNA library (genebank accession # AF 137027), implying a function of murine and 
human TCLlb genes in the early embryogenesis. 

5 The identification of five more murine members of the Tell family provides 

a better understanding of the structural differences and similarities between the Tell 
family of proteins. A comparison of the protein sequences of all members of the 
family including murine and human MTCP1 shows that, although overall 
homologies between the genes are low, the hydrophobic core region as described 

10 by Fu et al. (14) is preserved. This indicates a similar function for all of these 
proteins as transporters of small molecules such as retinoids, nucleosides, or fatty 
acids as suggested previously for Tell and Mtcpl (Hoh, F., et al., 1998, Structure, 
6:147-155; Fu, Z.Q., et al., 1998, Proc Natl Acad Sci USA, 95:3413-3418). 
However, compared to Tell and Mtcpl mouse and human Tel lb proteins show an 

15 insertion which form a surface accessible flexible loop or beta-sheet extension. The 
conserved charged residues in the insert loop play a significant role in mediating 
interactions with other proteins or ligands and also influence the quaternary 
structure of mouse Tcllb(Hoh, F., et al., 1998, Structure, 6:147-155). 

Altogether, murine and human TCL1 loci show significant differences: There 

20 are five murine Tcllb genes compared to one human TCLlb. The homology of 
human and mouse TCLlb is low and the expression levels of murine Tell and Tcllb 
in lymphoid tissues and cell lines is much lower than the expression levels of their 
human equivalents. Moreover, murine homologues of TNG1 and TNG2 were not 
found. This implies that there are also be significant differences in the function of 

25 human and mouse TCL1 loci. Further investigation should lead to a better 
understanding of the role of Tell and Tcllb in normal development and T cell 
leukemia. 

The present invention relates to nucleotide sequences of TCL-lb (SEQ. ID. 
NO: 40);, TNG1 (SEQ. ID. NO: 45); and TNG2 (SEQ. ID. NO: 46); genes and 
30 amino acid sequences of their encoded Tcl-lb (SEQ. ID. NO: 39);, TNG1 (SEQ. 
ID. NO: 42); and TNG2 (SEQ. ID. NO: 44), respectively, proteins, as well as 
derivatives and analogs thereof, and antibodies thereto. The present invention 
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further relates to the use of TCL-lb, TNG1 and TNG2 genes and their encoded 
proteins or derivatives or analogs thereof, and antibodies thereto, in assays for the 
detection and in treatment/prevention of disease states associated with chromosomal 
abnormalities and/or increased expression of TCLlb, TNG1 and TNG2. The present 
5 invention also relates to therapeutic compositions comprising Tel -lb, TNG1 and 
TNG2, proteins, derivatives or analogs thereof, antibodies thereto, nucleic acids 
encoding these proteins, derivatives or analogs, and antisense nucleic acids. 

The TCL-lb, TNG1 and TNG2 gene sequences are from one of many 
different species, including but not limited to, mammalian, bovine, ovine, porcine, 
10 equine, rodent and human, in naturally occurring sequence or in variant form, or 
from any source, whether natural, synthetic, or recombinant. In a specific 
embodiment described herein, the TCL-lb, TNG1 and TNG2 gene sequence are 
y human sequences. The Tel- lb, Tngl and Tngl proteins are those present in one of 

Ul many different species, including but not limited to, mammalian, bovine, ovine, 

15 porcine, equine, rodent and human, in naturally occurring or variant form, or from 
r! any source, whether natural, synthetic, or recombinant. In the specific embodiment 

'4t described herein, the above proteins are human proteins. 

O As defined herein, a Tel- lb, Tngl and Tng2 derivative is a fragment or 

[2 amino acid variant of the Tcl-lb, Tngl and Tng2 sequence (SEQ. ID. NO: 39, 42, 

Ul 20 and 44), respectively, as long as the fragment or amino acid variant is capable of 

□ displaying one or more biological activities associated with the full-length proteins. 

Such biological activities include, but are not limited to, antigenicity, i.e., the ability 
to bind to an their respective antibodies, and immunogenicity, i.e., the ability to 
generate an antibody which is capable able of binding a Tcl-lb, Tngl or Tng2 
25 protein, respectively. 

The invention provides fragments of a Tcl-lb, Tngl or Tng2 protein 
consisting of at least 10 amino acids, or of at least 25 amino acids, or of at least 50 
amino acids, or of at least 114 amino acids. Nucleic acids encoding such derivatives 
or analogs are also within the scope of the invention. A preferred Tel -lb, Tngl or 
30 Tng2 protein variant is one sharing at least 70% amino acid sequence homology, a 
particularly preferred Tel -lb, Tngl or Tng2 protein variant is one sharing at least 
80% amino acid sequence homology and another particularly preferred Tel -lb, 
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Tngl or Tng2 protein variant is one sharing at least 90% amino acid sequence 
homology to the naturally occurring Tel -b, Tngl or Tng2 protein over at least 25, 
at least 50, at least 75 or at least 100 contiguous amino acids of the Tel -lb, Tngl or 
Tng2 amino acid sequence, respectively. As used herein, amino acid sequence 

5 homology refers to amino acid sequences having identical amino acid residues or 
amino acid sequences containing conservative changes in amino acid residues. In 
another embodiment, a Tel -lb, Tngl or Tng2 homologous protein is one that 
shares the foregoing percentages of sequences identical with the naturally occurring 
Tel -lb, Tngl or Tng2 protein, respectively, over the cited lengths of amino acids. 

10 The TCL-bl, TNG1 and TNG2 genes (SEQ. ID. NO: 40, 45, and 46, 

respectively), are located in the region of chromosome 14q32.1 that is located in a 
region banded by two clusters of breakpoints. Due to the similarities between the 
TCL1 and TCL-lb gene structure, sequence and location, their expression patterns 
were compared. In addition the expression patterns of TNG1 and TNG2, which are 

15 located at the same locus as that of TCL1 and TCL-lb, were investigated (Figure 7). 
Expression in normal tissue was mostly negative for TCLlb, Figure 7A. The TCL1 
gene expression, however, was detected in most hematopoietic tissues and both 
TCL1 and TCLlb are expressed in spleen, tonsil, fetal liver, fetal kidney and fetal 
thymus. The TCLlb gene (SEQ. ID. NO: 39) is expressed in a wider variety of 

20 tissues including placenta, kidney and fetal spleen, as shown in Figure 8A. Low 
levels of expression of TNG1 and TNG2 were present in most tissues examined, in 
addition to those for TCLlb, expression in fetal lung, fetal heart and fetal liver. The 
only exception is the thymus, which showed transcripts of 77VG2, whereas fetal 
thymus only expressed TNG1 (Figure 6B). The detection of TCL-lb, TNG1 and 

25 TNG2 mRNA in patient samples, such as biopsied cells and tissues, is used as an 
indicator of the presence of T-cell leukemias and lymphomas associated with certain 
chromosome 14 abnormalities and/or increased expression of Tel- lb, Tngl or Tng2 
proteins. Also, the Tel -lb, Tngl or Tng2 amino acid sequences of the present 
invention (SEQ. ID. NO: 39, 42, and 44, respectively), are used to generate 

30 antibodies useful in immunoassays for the detection or measurement of Tel -lb, 
Tngl or Tng2 proteins in patient samples, respectively. Such antibodies are used in 
diagnostic immunoassays, for the detection or measurement of increased levels of 
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Tel -lb, Tngl or Tng2 proteins, respectively, associated with T-cell leukemias and 
lymphomas. 

In accordance with the present invention, polynucleotide sequences coding 
for a Tcl-lb, Tngl or Tng2 proteins (SEQ. ID. NO: 38, 41, and 43),, derivatives, 

5 e.g. fragment, or analog thereof, are inserted into an appropriate expression vector, 
i.e., a vector which contains the necessary elements for the transcription and 
translation of the inserted protein-coding sequence, for the generation of 
recombinant DNA molecules that direct the expression of a Tcl-lb, Tngl or Tng2 
proteins. Such Tel -lb, Tngl or Tng2 polynucleotide sequences, as well as other 

10 polynucleotides or their complements, are also used in nucleic acid hybridization 
assays, Southern and Northern blot analysis, etc. In a specific embodiment, a human 
TCL-lb, TNG1 or TNG2 gene (SEQ. ID. NO: 40, 45, and 46, respectively), or a 
sequence encoding a functionally active portion of a human TCL-lb, TNG1 or TNG2 
gene, is expressed. In yet another embodiment, a derivative or fragment of a human 

15 TCL-lb TNG1 or TNG2 gene is expressed. 

The TCL-lb, TNG1 and TNG2 Coding Sequences 

In a specific embodiment disclosed herein the invention relates to the nucleic 
acid sequence of the human TCL-lb, TNG1 and TNG2 genes(SEQ. ID. NO: 40, 45, 

20 and 46, respectively). In a preferred, but not limiting, aspect of the invention, a 
human TCL-lb cDNA sequence (SEQ. ID. NO: 38) was identified in the expressed 
sequence tag database (accession no. AA689513) that was homologous to TCL-1 and 
MTCP1, the other members of the TCL1 gene family. Such a sequence was isolated 
and cloned as a 1.2 kilobase full-length cDNA, as described, supra. The TNG1 and 

25 TNG2 genes were also isolated and identified and their sequences compared to the 
expressed sequence database (EST). The 1.5 kilobase cDNA of TNG1 (SEQ. ID. 
NO: 41) contains an open reading frame encoding a protein of 141 amino acids and 
the 2 kilobase TNG2 gene (SEQ. ID. NO: 43) encodes a protein of 110 amnino 
acids, as described, supra. The invention also relates to nucleic acid sequences 

30 hybridizable or complementary to the foregoing sequences, or equivalent to the 
foregoing sequences, in that the equivalent nucleic acid sequences also encode a Tcl- 
lb, Tngl or Tng2 protein product. 
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In a preferred aspect, polymerase chain reaction (PCR) is used to amplify the 
desired nucleic acid sequence in the library by using oligonucleotide primers 
representing known TCL-lb, TNG1 or TNG2 sequences (SEQ. ID. NO: 38, 41, and 
43, respectively). Such primers are used to amplify sequences of interest from an 

5 RNA or DNA source, preferably a cDNA library. PCR is carried out by use of a 
Perkin-Elmer Cetus thermal cycler and Taq polymerase, as is well known by those 
skilled in the art. The DNA being amplified is mRNA or cDNA or genomic DNA 
from any eukaryotic species. Several different degenerate primers are synthesized 
for use in PCR amplification reactions. The stringency of hybridization conditions 

10 used in priming the PCR reactions are also varied in order to allow for greater or 
lesser degrees of nucleotide sequence homology between the TCL-lb, TNG1 or 
TNG2 gene being cloned and that of the TCL-lb, TNG1 or TNG2 genes (SEQ. ID. 
NO: 40, 45, and 46, respectively) of the present invention. 

After successful amplification of a segment of the TCL-lb, TNG1 or TNG2 

15 gene, an allelic, a polymorphic variant, or a species homology of the TCL-lb , TNG1 
or TNG2 gene, that segment is molecularly cloned and sequenced, and utilized as a 
probe to isolate a complete cDNA or genomic clone. This will permit the 
determination of the gene's complete nucleotide sequence, the analysis of its 
expression, and the production of its protein product for functional analysis. This 

20 allows for the identification of additional genes encoding the Tel- lb, Tngl or Tng2, 
respectively, proteins. 

Potentially, any eukaryotic cell can serve as the nucleic acid source for the 
molecular cloning of the TCL-lb, TNG1 or TNG2 gene. The nucleic acid sequences 
encoding TCL-lb, TNG1 or TNG2 gene are isolated from, for example, human, 

25 porcine, bovine, feline, avian, equine, canine, rodent, as well as additional primate 
sources. The DNA is obtained by standard procedures known in the art from, for 
example, cloned DNA (e.g., a DNA "library"), by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments thereof, purified from a 
desired cell. (See, for example, Sambrook et al., 1989, Molecular Cloning, A 

30 Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.; Glover, D. M. (ed.), 1985, DNA Cloning: A Practical Approach, 
MRL Press, Ltd., Oxford, U.K. Vol. I, II.) A preferred source is cDNA of 
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leukemic cells in which the leukemia is associated with a 14q32.1 chromosomal 
abnormality. Clones derived from genomic DNA contain regulatory and intron 
DNA regions in addition to coding regions, while clones derived from cDNA will 
contain only TCL-lb exon sequences. In a particular embodiment of the present 

5 invention, a genomic sequence is one that is not more than 10 kilobases (kb), or not 
more than 20 kb, or not more than 50 kb or not more than 70 kb. Whatever the 
source, the gene should be molecular ly cloned into a suitable vector for propagation 
of the gene. In a particular embodiment, a preferred source of nucleic acid for the 
isolation of TCL-lb, TNG1 or TNG2 gene sequences is from pre B-cells. 

10 In the molecular cloning of the gene from genomic DNA, DNA fragments 

are generated, some of which will encode the desired gene. The DNA is cleaved at 
specific sites using various restriction enzymes. Alternatively, DNAse in the 
presence of manganese is used to fragment the DNA, or the DNA is physically 
sheared, as for example, by sonication. The linear DNA fragments is then separated 

15 according to size by standard techniques, including but not limited to, agarose and 
poly aery lamide gel electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene is accomplished in a number of ways. For 
example, a TCL-lb, TNG1 or TNG2 gene (SEQ. ID. NO: 40, 45, and 46, 

20 respectively) of the present invention or its specific RNA, or a fragment thereof, 
such as a probe or primer, is isolated and labeled and then used in hybridization 
assays to detect a generated TCL-1, TNG1 or TNG2 gene (Benton, W. and Davis, 
R., 1977, Science, 196:180; Grunstein, M. And Hogness, D., 1975, Proc Natl 
Acad Sci USA, 72:3961). Those DNA fragments sharing substantial sequence 

25 homology to the probe will hybridize under stringent conditions. The phrase 
"stringent conditions" as used herein refers to those hybridizing conditions that 
(Virgilio, L., et al., 1994, Proc Natl Acad Sci USA, 91:12530-12534) employ low 
ionic strength and high temperature for washing, for example, 0.015 M 
NaCl/0.0015 M sodium citrate/0.1% SDS at 50. degree. C; (Narducci, M.G., et 

30 al., 1997, Cancer Res, 57:5452-5456) employ, during hybridization, a denaturing 
agent such as formamide, for example, 50% (vol/ vol) formamide with 0.1% bovine 
serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate 
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buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42. degree. C; or 
(Virgilio, L., et al., 1998, Proc Natl Acad Sci USA, 95:3885-3889) employ 50% 
formamide, S.times.SSC (0.75 M NaCl, 0.075 M sodium pyrophosphate, 
S.times.Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, 

5 and 10% dextran sulfate at 42. degree. C, with washes at 42. degree. C. in 
0.2.times.SSC and 0.1% SDS. 

The appropriate fragment is also identified by restriction enzyme digestion(s) 
and comparison of fragment sizes with those expected according to a known 
restriction map. Further selection is carried out on the basis of the properties of the 

10 gene. Alternatively, the presence of the gene is detected by assays based on the 
physical, chemical, or immunological properties of its expressed product. For 
example, cDNA clones, or genomic DNA clones which hybrid-select the proper 
mRNAs, are selected which produce a protein that has similar or identical 
electrophoretic migration, isolectric focusing behavior, proteolytic digestion maps, 

15 binding activity or antigenic properties as known for Tel- lb. Alternatively, the Tcl- 
lb protein may be identified by binding of labeled antibody to the putatively Tel -lb 
expressing clones, e.g., in an ELISA (enzyme-linked immunosorbent assay)-type 
procedure. 

The TCL-lb, TNG1 or TNG2 gene is also identified by mRNA selection by 
20 nucleic acid hybridization followed by in vitro translation. In this procedure, 
fragments are used to isolate complementary mRNAs by hybridization. Such DNA 
fragments may represent available, purified TCL-lb, TNG1 or TNG2 DNA of 
another TCL-lb , TNG1 or TNG2 gene, respectively. Immunoprecipitation analysis, 
or functional assays, of the in vitro translation products of the isolated products of 
25 the isolated mRNAs identifies the mRNA and, therefore, the complementary DNA 
fragments that contain the desired sequences. In addition, specific mRNAs are 
selected by adsorption of polysomes isolated from cells to immobilized antibodies 
specifically directed against Tel- lb, Tngl or Tng2 protein. A radiolabelled TCL-lb, 
TNG1 or TNG2 cDNA is synthesized using the selected mRNA (from the adsorbed 
30 polysomes) as a template. The radiolabelled mRNA or cDNA is then used as a 
probe to identify the TCL-lb, TNG1 or TNG2 DNA fragments, respectively, from 
among other genomic DNA fragments. 
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Alternatives to isolating the TCL-lb, TNG1 or TNG2 genomic DNA include, 
but are not limited to, chemically synthesizing the gene sequence itself from a 
known sequence or making cDNA to the mRNA which encodes the Tel- lb, Tngl or 
Tng2, respectively, protein. For example, RNA useful in cDNA cloning of the TCL- 

5 lb , TNG1 or TNG2 gene is isolated from cells which express Tel- lb, Tngl or 
Tng2, respectively, e.g., pre-B acute lymphoblastic leukemia cells or endemic 
Burkitt's lymphoma cells which express cell surface IgM and do not secrete 
immunoglobulin. Other methods are known to those of skill in the art and are within 
the scope of the invention. 

10 The identified and isolated gene is then inserted into an appropriate cloning 

vector. A large number of vector-host systems known in the art may be used. 
Possible vectors include, but are not limited to, plasmids or modified viruses, but 
the vector system must be compatible with the host cell used. Such vectors include, 
but are not limited to, bacteriophages such as lambda derivatives, or plasmids such 

15 as PBR322 or pUC plasmid derivatives. The insertion into a cloning vector can, for 
example, be accomplished by ligating the DNA fragment into a cloning vector 
which has complementary cohesive termini. However, if the complementary 
restriction sites used to fragment the DNA are not present in the cloning vector, the 
ends of the DNA molecules may be enzymatically modified. Alternatively, any site 

20 desired may be produced by ligating nucleotide sequences (linkers) onto the DNA 
termini; these ligated linkers comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. In an 
alternative method, the cleaved vector and TCL-lb , TNG1 or TNG2 gene is 
modified by homopolymeric tailing. Recombinant molecules are introduced into host 

25 cells via transformation, transfection, infection, electroporation, or other methods 
known to those of skill in the art, so that many copies of the gene sequence are 
generated. 

In an alternative method, the desired gene is identified and isolated after 
insertion into a suitable cloning vector in a "shot gun" approach. Enrichment for the 
30 desired gene, for example, by size fractionization, is done before insertion into the 
cloning vector. 
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In specific embodiments, transformation of host cells with recombinant DNA 
molecules that incorporate the isolated TCL-lb , TNG1 or TNG2 gene, cDNA, or 
synthesized DNA sequence enables generation of multiple copies of the gene. Thus, 
the gene is obtained in large quantities by growing transformants, isolating the 

5 recombinant DNA molecules from the transformants and, when necessary, 
retrieving the inserted gene from the isolated recombinant DNA. 

Oligonucleotides containing a portion of the TCL-lb, TNG1 or TNG2 coding 
or non-coding sequences, or which encode a portion of the Tel- lb, Tngl or Tng2, 
respectively, protein (e.g., primers for use in PCR) are synthesized by standard 

10 methods commonly known in the art. Such oligonucleotides preferably have a size 
in the range of 8 to 25 nucleotides. In a particular embodiment herein, such 
oligonucleotides have a size in the range of 15 to 25 nucleotides or 18 to 25 
nucleotides. 

15 Expression of the TCL-lb , TNG1 or TNG2 Gene 

In accordance with the present invention, polynucleotide sequences coding 
for a Tel- lb, Tngl or Tng2 protein, derivative, e.g. fragment, or analog thereof, 
can be inserted into an appropriate expression vector, i.e., a vector which contains 

20 the necessary elements for the transcription and translation of the inserted protein- 
coding sequence, for the generation of recombinant DNA molecules that direct the 
expression of a Tel -lb , Tngl or Tng2 protein. Such TCL-lb, TNG1 or TNG2, 
respectively, polynucleotide sequences, as well as other polynucleotides or their 
complements, may also be used in nucleic acid hybridization assays, Southern and 

25 Northern blot analysis, etc. In a specific embodiment, a human TCL-lb, TNG1 or 
TNG2 gene, or a sequence encoding a functionally active portion of a human TCL- 
lb , TNG1 or TNG2 gene is expressed. In yet another embodiment, a derivative or 
fragment of a human TCL-lb , TNG1 or TNG2 gene is expressed. 

Due to the inherent degeneracy of the genetic code, other DNA sequences 

30 which encode substantially the same or a functionally equivalent Tel- lb amino acid 
sequence, is within the scope of the invention. Such DNA sequences include those 
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which are capable of hybridizing to the human TCL-lb, TNG1 or TNG2 sequence 
under stringent conditions. 

Altered DNA sequences which are used in accordance with the invention 
include deletions, additions or substitutions of different nucleotide residues resulting 
5 in a sequence that encodes the same or a functionally equivalent gene product. The 
gene product itself may contain deletions, additions or substitutions of amino acid 
residues within a TCL-lb, TNG1 or TNG2 sequence, which result in a silent change, 
thus producing a functionally equivalent Tel- lb, Tngl or Tng2, respectively, 
protein. Such amino acid substitutions are made on the basis of similarity in 
10 polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, negatively charged amino acids 
include aspartic acid and glutamic acid; positively charged amino acids include 
¥ lysine and arginine; amino acids with uncharged polar head groups having similar 

Ul hydrophilicity values include the following: leucine, isoleucine, valine; glycine, 

Ep 15 alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. 

J; The DNA sequences of the invention are engineered in order to alter a TCL- 

lb , TNG1 or TNG2 coding sequence for a variety of ends, including but not limited 
p to alterations which modify processing and expression of the gene product. For 

f: example, mutations introduced using techniques which are well known in the art, 

Ul 20 e.g., site-directed mutagenesis, to insert new restriction sites, to alter 

□ phosphorylation, etc. 

In another embodiment of the invention, a TCL-lb , TNG1 or TNG2 gene 
sequence or a derivative thereof is ligated to a non- TCL-lb , non-TNGl or non- 
TNG2 gene sequence to encode a chimeric fusion protein. A fusion protein is 
25 engineered to contain a cleavage site located between a Tel -lb, Tngl or Tng2, 
respectively, sequence and the non-Tcl-lb, non-Tngl or non-Tng2, respectively, 
protein sequence, so that the Tel -lb, Tngl or Tng2 protein, respectively may be 
cleaved away from the non- Tel -lb, non-Tngl or non-Tng2, respectively, moiety. 
In a specific embodiment, the Tel- lb, non-Tngl or non-Tng2, respectively, amino 
30 acid sequence present in the fusion protein consists of at least 10 contiguous amino 
acids, at least 25 contiguous amino acids, at least 50 contiguous amino acids, at least 
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75 contiguous amino acids, at least 100 contiguous amino acids, or at least 114 
amino acids of the Tel -lb, non-Tngl or non-Tng2, protein sequence. 

In an alternate embodiment of the invention, the coding sequence of a Tel - 
lb, Tngl or Tng2, is synthesized in whole or in part, using chemical methods well 

5 known in the art. See, for example, Caruthers et al., 1980, Nuc. Acids Res. Symp. 
Ser. 7:215-233; Crea and Horn, 1980, Nuc. Acids Res. 9(10):2331; Matteucci and 
Caruthers, 1980, Tetrahedron Letters 21:719; and Chow and Kempe, 1981, Nuc. 
Acids Res. 9(12):2807-2817. Alternatively, the protein itself is produced using 
chemical methods to synthesize a Tel -lb, Tngl or Tng2 amino acid sequence in 

10 whole or in part. For example, peptides are synthesized by solid phase techniques, 
cleaved from the resin, and purified by preparative high performance liquid 
chromatography, (e.g., see Creighton, 1983, Proteins Structures And Molecular 
Principles, W. H. Freeman and Co., N.Y. pp. 50-60). The composition of the 
synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the 

15 Edman degradation procedure; see Creighton, 1983, Proteins, Structures and 
Molecular Principles, W. H. Freeman and Co., N.Y., pp. 34-49. 

In order to express a biologically active Tel -lb, Tngl or Tng2 protein or 
derivative thereof, a polynucleotide sequence encoding a Tel- lb, Tngl or Tng2, 
resepectively, protein, or a derivative thereof, is inserted into an appropriate 

20 expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. The TCL-lb, TNG1 or 
TNG2 gene products, as well as host cells or cell lines transfected or transformed 
with recombinant TCL-lb, TNG1 or TNG2, respecitvely, expression vectors, are 
used for a variety of purposes. These include, but are not limited to, generating 

25 antibodies (i.e., monoclonal or polyclonal) that immunospecifically bind a Tel -lb 
protein. Anti- Tel -lb, antig-Tngl or anti-Tng2 antibodies are used in detecting or 
measuring levels of a Tel -lb, Tngl or Tng2, respectively, protein in patient 
samples. 

30 Expression Systems 
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Methods which are well known to those skilled in the art are used to 
construct expression vectors containing a TCL-lb, TNG1 or TNG2 coding sequence 
and appropriate transcriptional/translational control signals. These methods include 
in vitro recombinant DNA techniques, synthetic techniques and in vivo 

5 recombination/genetic recombination. See, for example, the techniques described in 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual 2d ed., Cold 
Spring Harbor Laboratory, N.Y. and Ausubel et al., 1989, Current Protocols in 
Molecular Biology, Greene Publishing Associates and Wiley Interscience, N.Y. 

A variety of host-expression vector systems are utilized to express a TCL-lb, 

10 TNG1 or TNG2 coding sequence. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage DNA, 
plasmid DNA or cosmid DNA expression vectors containing a TCL-lb, TNG1 or 
TNG2 coding sequence; yeast transformed with recombinant yeast expression 
vectors containing a TCL-lb , TNG1 or TNG2 coding sequence; insect cell systems 

15 infected with recombinant virus expression vectors (e.g., baculo virus) containing an 
TCL-lb f TNG1 or TNG2 coding sequence; plant cell systems infected with 
recombinant virus expression vectors (e.g., cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
vectors (e.g., Ti plasmid) containing a TCL-lb, TNG1 or TNG2 coding sequence; or 

20 animal cell systems. The expression elements of these systems vary in their strength 
and specificities. Depending on the host/ vector system utilized, any of a number of 
suitable transcription and translation elements, including constitutive and inducible 
promoters, are used in the expression vector. For example, when cloning in 
bacterial systems, inducible promoters such as pL of bacteriophage. lambda., plac, 

25 ptrp, ptac (ptrp-lac hybrid promoter) and the like are used; when cloning in insect 
cell systems, promoters such as the baculovirus polyhedrin promoter are used; when 
cloning in plant cell systems, promoters derived from the genome of plant cells 
(e.g., heat shock promoters; the promoter for the small subunit of RUBISCO; the 
promoter for the chlorophyll a/b binding protein) or from plant viruses (e.g., the 

30 35S RNA promoter of CaMV; the coat protein promoter of TMV) are used; when 
cloning in mammalian cell systems, promotersderived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from mammalian viruses (e.g., 
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the adenovirus late promoter; the vaccinia virus 7.5 K promoter) are used; when 
generating cell lines that contain multiple copies of a TCL-lb, TNG1 or TNG2 
DNA, SV40-, BPV- and EBV-based vectors are used with an appropriate selectable 
marker. 

5 In bacterial systems, a number of expression vectors are advantageously 

selected depending upon the use intended for the Tcl-lb, Tngl or Tng2 protein 
expressed. For example, when large quantities of Tel -lb, Tngl or Tng2 protein are 
produced for the generation of antibodies, vectors which direct the expression of 
high levels of fusion protein products that are readily purified are desirable. Such 

10 vectors include, but are not limited to, the E. coli expression vector pUR278 
(Ruther, et al., 1983, EMBO 7, 2:1791), in which the TCL-lb, TNG1 or TNG2 
coding sequence are ligated into the vector in frame with the lac Z coding region so 
that a hybrid AS-lac Z protein is produced; pIN vectors (Inouye & Inouye, 1985, 
Nucleic Acids Res, 13:3101-3109; Van Heeke & Schuster, 1989, J Biol Chem, 

15 264:5503-5509); and the like. pGEX vectors are also used to express foreign 
polypeptides as fusion proteins with glutathione S-transferase (GST). In general, 
such fusion proteins are soluble and easily purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free glutathione. 
The PGEX vectors are designed to include thrombin or factor Xa protease cleavage 

20 sites so that the cloned polypeptide of interest is released from the GST moiety. 

In yeast, a number of vectors containing constitutive or inducible promoters 
are used. For a review see, Current Protocols in Molecular Biology, Vol. 2, 1988, 
Ed. Ausubel et al., Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et 
al., 1987, Expression and Secretion Vectors for Yeast, in Methods in Enzymology, 

25 Ed. Wu & Grossman, 1987, Acad. Press, N.Y. 153:516-544; Glover, 1986, DNA 
Cloning. Vol. II, IRL Press, Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous 
Gene Expression in Yeast, Methods in Enzymology, Eds. Berger & Kimmel, Acad. 
Press, N.Y. 152:673-684; and The Molecular Biology of the Yeast Saccharomyces, 
1982, Eds. Strathern et al., Cold Spring Harbor Press, Vols. I and II. 

30 In cases where plant expression vectors are used, the expression of a TCL- 

lb, TNG1 or TNG2 coding sequence is driven by any of a number of promoters. For 
example, viral promoters such as the 35S RNA and 19S RNA promoters of CaMV 
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(Brisson, et aL, 1984, Nature, 310:511-514), or the coat protein promoter of TMV 
(Takamatsu, et al., 1987, EMBO J, 6:307-311) are used; alternatively, plant 
promoters such as the small subunit of RUBISCO (Coruzzi, et al., 1984, EMBO J, 
3:1671-1680; Broglie, et aL, 1984, Science, 224:838-843); or heat shock 
5 promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley, et aL, 1986, Mol Cell 
Biol, 6:559-565) are used. These constructs are introduced into plant cells using Ti 
plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, 
microinjection, electroporation, etc. For reviews of such techniques see, for 
example, Weissbach & Weissbach, 1988, Methods for Plant Molecular Biology, 
10 Academic Press, N.Y., Section VIII, pp. 421-463; and Grierson & Corey, 1988, 
Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

An alternative expression system which could be used to express a TCL-lb, 
TNG1 or TNG2 gene is an insect ystem. In one such system, Autographa californica 
nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes. 
15 The virus grows in Spodoptera frugiperda cells. A TCL-lb, TNG1 or TNG2 coding 
sequence is cloned into non-essential regions (for example the polyhedrin gene) of 
the virus and placed under control of an AcNPV promoter (for example, the 
polyhedrin promoter). Successful insertion of a TCL-lb, TNG1 or TNG2 coding 
sequence will result in inactivation of the polyhedrin gene and production of non- 
20 occluded recombinant virus (i.e., virus lacking the proteinaceous coat coded for by 
the polyhedrin gene). These recombinant viruses are then used to infect Spodoptera 
frugiperda cells in which the inserted gene is expressed, (e.g., see Smith, et aL, 
1983, J Virol, 46:584; Smith, U.S. Pat. No. 4,215,051). 

In mammalian host cells, a number of viral based expression systems are 
25 utilized. In cases where an adenovirus is used as an expression vector, a TCL-lb, 
TNG1 or TNG2 coding sequence is ligated to an adenovirus transcription/translation 
control complex, e.g., the late promoter and tripartite leader sequence. This 
chimeric gene is then inserted in the adenovirus genome by in vitro or in vivo 
recombination. Insertion in a non-essential region of the viral genome (e.g., region 
30 El or E3) will result in a recombinant virus that is viable and capable of expressing 
a TCL-lb in infected hosts, (e.g., see Logan & Shenk, 1984, Proc. Natl. Acad. Sci. 
U.S.A. 81:3655-3659). Alternatively, the vaccinia 7.5 K promoter are used. (See, 
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e.g., Mackett, etal., 1982, Proc Natl Acad Sci USA, 79:7415-7419; Mackett, etal., 
1984, J Virol, 49:857-864; Panicali, et al., 1982, Proc Natl Acad Sci USA, 
79:4927-4931). 

Specific initiation signals may also be required for efficient translation of an 

5 inserted TCL-lb, TNG1 or TNG2 coding sequences. These signals include the ATG 
initiation codon and adjacent sequences. In cases where an entire TCL-lb, TNG1 or 
TNG2 gene, including its own initiation codon and adjacent sequences, is inserted 
into the appropriate expression vector, no additional translational control signals 
may be needed. However, in cases where only a portion of a TCL-lb, TNG1 or 

10 TNG2 coding sequence is inserted, lacking the 5' end, exogenous translational 
control signals, including the ATG initiation codon, must be provided. Furthermore, 
the initiation codon must be in phase with the reading frame of a TCL-lb, TNG1 or 
TNG2 coding sequence to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons of are of a variety of origins, both 

15 natural and synthetic. The efficiency of expression are enhanced by the inclusion of 
appropriate transcription enhancer elements, transcription terminators, etc. (see 
Bittner, et al., 1987, Methods in Enzymol, 153:516-544). 

In addition, a host cell strain is chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific 

20 fashion desired. Such modifications (e.g., phosphorylation) and processing (e.g., 
cleavage) of protein products may be important for the function of the protein. 
Different host cells have characteristic and specific mechanisms for the post- 
radiational processing and modification of proteins. Appropriate cells lines or host 
systems are chosen to ensure the correct modification and processing of the foreign 

25 protein expressed. To this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, and phosphorylation of 
the gene product are used. Such mammalian host cells include but are not limited to 
CHO, VERO, BHK, HeLa, COS, MDCK, 293, WI38, etc. 

For long-term, high-yield production of recombinant proteins, stable 

30 expression is preferred. For example, cell lines which stably express a Tel -lb, 
Tngl or Tng2 protein are engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells are transformed with TCL-lb, TNG1 
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or TNG2 DNA, respectively, controlled by appropriate expression control elements 
(e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation 
sites, etc.), and a selectable marker. Following the introduction of foreign 

DNA, engineered cells are allowed to grow for 1-2 days in an enriched media, and 

5 are then switched to a selective media. The selectable marker in the recombinant 
plasmid confers resistance to the selection and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci which in turn are cloned and 
expanded into cell lines. This method is advantageously used to engineer cell lines 
which express a Tcl-lb, Tngl or Tng2, respectively, protein. The present invention 

10 provides a method for producing a recombinant Tel -lb, Tngl or Tng2 protein 
comprising culturing a host cell transformed with a recombinant expression vector 
encoding a Tel -lb, Tngl or Tng2, respectively, protein such that the Tel -lb, Tngl 
or Tng2 protein is expressed by the cell and recovering the expressed Tel -lb, Tngl 
or Tng2 protein. 

15 A number of selection systems are used, including, but not limited to, the 

herpes simplex virus thymidine kinase (Wigler, et al., 1977, Cell, 11:223), 
hypoxanthine-guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, 
Proc Natl Acad Sci USA, 48:2026), and adenine phosphoribosyltransferase (Lowy, 
et al., 1980, Cell, 22:817) genes can be employed in tk-, hgprt- or aprt-cells, 

20 respectively. Also, antimetabolite resistance is used as the basis of selection for 
dhfr, which confers resistance to methotrexate (Wigler, et al., 1980, Natl Acad Sci 
USA, 77:3567; O'Hare, etal., 1981, Proc Natl Acad Sci USA, 78:1527); gpt, which 
confers resistance to mycophenolic acid (Mulligan & Berg, 1981, Proc Natl Acad 
Sci USA, 78:2072); neo, which confers resistance to the aminoglycoside G-418 

25 (Colberre-Garapin, et al., 1981, / Mol Biol, 150:1); and hygro, which confers 
resistance to hygromycin (Santerre, et al., 1984, Gene, 30:147). Recently, 
additional selectable genes have been described, namely trpB, which allows cells to 
utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, 1988, Proc Natl Aca. Sci USA, 85:8047); 

30 and ODC (ornithine decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue, 
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L., 1987, In: Current Communications in Molecular Biology, Cold Spring Harbor 
Laboratory, Ed.)- 



Identification of Transfectants or Transformants That Express TcMb, Tngl or Tng2 

5 

The host cells which contain the coding sequence and which express the 
biologically active gene product are identified by at least four general approaches; 
(a) DNA-DNA or DNA-RNA hybridization; (b) the presence or absence of 
"marker" gene functions; (c) assessing the level of transcription as measured by the 
10 expression of TCL-lb, TNG1 or TNG2 mRNA transcripts in the host cell; and (d) 
detection of the gene product as measured by immunoassay or by its biological 
activity. 

In the first approach, the presence of the TCL-lb, TNG1 or TNG2 coding 
sequence inserted in the expression vector is detected by DNA-DNA or DNA-RNA 
15 hybridization using probes comprising nucleotide sequences that are homologous to 
the TCL-lb, TNG1 or TNG2 coding sequence, respectively, or portions or 
derivatives thereof. 

In the second approach, the recombinant expression vector/host system is 
identified and selected based upon the presence or absence of certain "marker" gene 

20 functions (e.g., thymidine kinase activity, resistance to antibiotics, resistance to 
methotrexate, transformation phenotype, occlusion body formation in baculovirus, 
etc.). For example, if the human TCL-lb, TNG1 or TNG2 coding sequence is 
inserted within a marker gene sequence of the vector, recombinant cells containing 
the TCL-lb, TNG1 or TNG2 coding sequence are identified by the absence of the 

25 marker gene function. Alternatively, a marker gene is placed in tandem with a TCL- 
lb, TNG1 or TNG2 sequence under the control of the same or different promoter 
used to control the expression of the TCL-lb, TNG1 or TNG2 coding sequence. 
Expression of the marker in response to induction or selection indicates expression 
of the TCL-lb, TNG1 or TNG2 coding sequence. 

30 In the third approach, transcriptional activity of a TCL-lb, TNG1 or TNG2 

gene is assessed by hybridization assays. For example, RNA is isolated and 
analyzed by Northern blot using a probe having sequence homology to a TCL-lb, 
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TNG1 or TNG, respectively, coding sequence or transcribed noncoding sequence or 
particular portions thereof. Alternatively, total nucleic acid of the host cell are 
extracted and quantitatively assayed for hybridization to such probes. 

In the fourth approach, the levels of a Tcl-lb, Tngl or Tng2 protein product 
is assessed immunologically, for example by Western blots, immunoassays such as 
radioimmuno-precipitation, enzyme-linked immunoassays and the like. 

Purification of the Expressed Gene Product 

Once a recombinant which expresses the TCL-lb, TNG1 or TNG2 gene 
sequence is identified, the gene product is analyzed. This is achieved by assays 
based on the physical or functional properties of the product, including radioactive 
labelling of the product followed by analysis by gel electrophoresis, immunoassay, 
or other detection methods known to those of skill in the art. 

Once the Tcl-lb, Tngl or Tng2 protein is identified, it is isolated and 
purified by standard methods including chromatography (e.g., ion exchange, 
affinity, and sizing column chromatography), centrifugation, differential solubility, 
or by any other standard technique for the purification of proteins. The functional 
properties are evaluated using any suitable assay. 

Alternatively, once a Tcl-lb, Tngl or Tng2 protein produced by a 
recombinant is identified, the amino acid sequence of the protein is deduced from 
the nucleotide sequence of the chimeric gene contained in the recombinant. As a 
result, the protein is synthesized by standard chemical methods known in the art 
(e.g., see Hunkapiller, etal., 1984, Nature, 310:105-111). 

In a specific embodiment of the present invention, such Tel -lb, Tngl or 
Tng2 proteins, whether produced by recombinant DNA techniques or by chemical 
synthetic methods, include, but are not limited to, those containing, as a primary 
amino acid sequence, all or part of the amino acid sequence, substantially, as in 
SEQ. ID. NO: 39, 42, and 44, respectively, as well as fragments and other 
derivatives, and analogs thereof. 

Generation of Antibodies to Tcl-lb, Tngl or Tng2 
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According to the invention, Tcl-lb, Tngl or Tng2 protein, its fragments or 
other derivatives, or analogs thereof, are used as an immunogen to generate 
antibodies which recognize such an immunogen. Such antibodies include but are not 
5 limited to polyclonal, monoclonal, chimeric, single chain, Fab fragments, and an 
Fab expression library. In a specific embodiment, antibodies to a human Tel -lb, 
Tngl or Tng2, respectively, protein are produced. 

Various procedures known in the art are used for the production of 
polyclonal antibodies to a Tel -lb, Tngl or Tng2 protein or derivative or analog. 
10 For the production of antibody, various host animals are immunized by injection 
with the native Tel -lb, Tngl or Tng2 protein, or a synthetic version, or derivative 
(e.g., fragment) thereof, including but not limited to rabbits, mice, rats, etc. 
J Various adjuvants are used to increase the immunological response, depending on 

1 the host species, and including but not limited to Freund's (complete and 

! 15 incomplete), mineral gels such as aluminum hydroxide, surface active substances 

such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as 
I BCG (bacille Calmette-Guerin) and corynebacterium parvum. 

; In a specific example, the entire protein product of the TCL-lb,TNGl or 

20 TNG2 gene expressed in bacteria was used to immunize rabbits against Tel -lb, 
Tngl or Tng2, respectivley. Such antibodies recognized the Tel -lb, Tngl or Tng2 
protein, respectively, in a variety of leukemia and lymphoma cells by Western Blot 
and by immunoprecipitation. 

For preparation of monoclonal antibodies directed toward a Tel -lb , Tngl 
25 or Tng2 protein sequence (SEQ. ID. NO: 39, 42, 44, respectively) or analog 
thereof, any technique which provides for the production of antibody molecules by 
continuous cell lines in culture are used. For example, the hybridoma technique 
originally developed by Kohler and Milstein (1975, Nature, 256:495-497), as well 
as the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 
30 1983, Immunology Today, 4:72), and the EBV-hybridoma technique to produce 
human monoclonal antibodies (Cole, et al., 1985, in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
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invention, monoclonal antibodies are produced in germ-free animals utilizing recent 
technology (PCT/US90/02545). According to the invention, human antibodies are 
used and are obtained by using human hybridomas (Cote, et al., 1983, Proc Natl 
Acad Sci USA, 80:2026-2030) or by transforming human B cells with EBV virus in 

5 vitro (Cole et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, pp. 77-96). In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" (Morrison, et al., 1984, Proc Natl Acad Sci 
USA, 81:6851-6855; Neuberger, et al., 1984, Nature, 312:604-608; Takeda et al., 
1985, Nature, 314:452-454) by splicing the genes from a mouse antibody molecule 

10 specific for Tcl-lb, Tngl or Tng2 proteins together with genes from a human 
antibody molecule of appropriate biological activity is used; such antibodies are 
within the scope of this invention. 

According to the invention, techniques described for the production of single 
chain antibodies (U.S. Pat. No. 4,946,778) are adapted to produce Tel -lb, Tngl or 

15 Tng2-specific single chain antibodies. An additional embodiment of the invention 
utilizes the techniques described for the construction of Fab expression libraries 
(Huse, et al., 1989, Science, 246:1275-1281) to allow rapid and easy identification 
of monoclonal Fab fragments with the desired specificity for Tcl-lb, Tngl or Tng2 
proteins, derivatives, or analogs. 

20 Antibody fragments which contain the idiotype of the molecule are generated 

by known techniques. For example, such fragments include, but are not limited to,: 
the F(ab').sub.2 fragment which is produced by pepsin digestion of the antibody 
molecule; the Fab' fragments which are generated by reducing the disulfide bridges 
of the F(ab').sub.2 fragment, and the Fab fragments which are generated by treating 

25 the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody is 
accomplished by techniques known in the art, e.g. ELISA (enzyme-linked 
immunosorbent assay). For example, to select antibodies which recognize a specific 
domain of a Tel -lb, Tngl orTng2 protein, one assays the generated hybridomas for 

30 a product which binds to a Tel -lb, Tngl or Tng2, respectively, fragment 
containing such domain. For selection of an antibody specific to human Tel -lb, 
Tngl or Tng2, one selects on the basis of positive binding to human Tel -lb, Tngl 
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or Tng2, respectiv ely, and a lack of binding to, for example, mouse Tel -lb, Tngl 
or Tng2. 

The foregoing antibodies are used in methods known in the art relating to the 
localization and activity of the protein sequences of the invention, e.g., for imaging 
5 these proteins, measuring levels thereof in appropriate physiological samples, etc. 

Structure of the Tel- lb, Tngl and Tng2 Gene and Protein 

The structure of the Tel- lb, Tngl and Tng2 gene and protein is analyzed by 
10 various methods known in the art. 

Genetic Analysis 

The cloned DNA or cDNA corresponding to the TCL-lb, TNG1 or TNG2 

15 gene is analyzed by methods including but not limited to Southern hybridization 
(Southern, E. M., 1975, J Mol Biol, 98:503-517), Northern hybridization (see, e.g., 
Freeman, et aL, 1983, Proc Natl Acad Sci USA, 80:4094-4098), restriction 
endonuclease mapping (Maniatis, T., 1982, Molecular Cloning, A Laboratory, Cold 
Spring Harbor, N.Y.), and DNA sequence analysis. Polymerase chain reaction 

20 (PCR; U.S. Pat. Nos. 4,683,202, 4,683,195, and 4,889,818; Proc Natl Acad Sci 
USA 85:7652-7656; Ochman, et al., 1988, Genetics, 120:621-623; Loh, et al., 
1989, Science, 243:217-220) followed by Southern hybridization with a TCL-lb, 
TNG1 or TNG2 specific probe allows the detection of the TCL-lb, TNG1 or TNG2 
gene, resepectively, in DNA from various cell types. In one embodiment, Southern 

25 hybridization is used to determine the genetic linkage of TCL-lb, TNG1 or TNG2, 
respectively. PCR followed by hybridization assay is also used to detect or measure 
TCL-lb f TNG1 or TNG2 RNA, respectively, or 14q32.1 chromosomal 
abnormalities. Northern hybridization analysis is used to determine the expression 
levels of the TCL-lb , TNG1 or TNG2 gene. Various cell types, at various states of 

30 development or activity are tested for TCL-lb expression. The stringency of the 
hybridization conditions for both Southern and Northern hybridization, or dot blots, 
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are manipulated to ensure detection of nucleic acids with the desired degree of 
relatedness to the specific TCL-lb, TNG1 or TNG2 probe respectively, used. 

Restriction endonuclease mapping is used to roughly determine the genetic 
structure of the TCL-lb, TNG1 or TNG2 gene. Restriction maps derived by 
restriction endonuclease cleavage are confirmed by DNA sequence analysis. 

DNA sequence analysis is performed by any techniques known in the art, 
including, but not limited to, the method of Maxam and Gilbert (1980, Meth 
Enzymol, 65:499-560), the Sanger dideoxy method (Sanger, et al., 1977, Proc Natl 
Acad Sci USA, 74:5463), the use of T7 DNA polymerase (Tabor and Richardson, 
U.S. Pat. No. 4,795,699), or use of an automated DNA sequenator (e.g., Applied 
Biosystems, Foster City, Calif.). The cDNA sequence of a representative TCL-lb , 
TNG1 or TNG2 gene comprises the sequence substantially as disclosed herein 
(SEQ. ID. NO: 38, 41 and 43, respectively). 

Protein Analysis 

The amino acid sequence of the Tel- lb, Tngl and Tng2 protein are derived 
by deduction from the DNA sequence, or alternatively, by direct sequencing of the 
protein, e.g., with an automated amino acid sequencer. The amino acid sequence of 
a representative Tel- lb, Tngl and Tng2 protein comprises the sequence substantially 
as depicted in SEQ ID NO: 39, 42, and 44, respectively, with the representative 
mature protein that is shown by amino acid numbers 1-128, 1-141, and 1-110, 
respectively. 

The Tel -lb, Tngl and Tng2 protein sequence are further characterized by a 
hydrophilicity analysis (Hopp, T. and Woods, K., 1981, Proc Natl Acad Sci USA, 
78:3824). A hydrophilicity profile is used to identify the hydrophobic and 
hydrophilic regions of the Tel -lb, Tngl or Tng2 protein and the corresponding 
regions of the gene sequence which encode such regions. 

Secondary structural analysis (Chou, P. and Fasman, G., 1974, 
Biochemistry, 13:222) is also done, to identify regions of the Tel- lb, Tngl or Tng2 
protein that assume specific secondary structures. 
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Manipulation, translation, and secondary structure prediction, as well as 
open reading frame prediction and plotting, is also accomplished using computer 
software programs available in the art. 

Other methods of structural analysis are also employed. These include, but 
are not limited to, X-ray crystallography (Engstom, A., 1974, Biochem Exp Biol, 
11:7-13) and computer modeling (Fletterick, R. and Zoller, M. (eds.), 1986, 
Computer Graphics and Molecular Modeling, in Current Communications in 
Molecular Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). 

Uses of TCL-lb , TNG1 or TNG2 and its Tel- lb Tngl or Tng2, respectively, 
Protein Product and Antibodies Thereto 

Chromosomal translocations and inversions associated with the TCL-lb, 
TNG1 or TNG2 locus on chromosome 14, e.g., t(14:14)(qll;q32) chromosome 
translocation, inv(14)(qll;q32) chromosome inversion, and t(7:14)(q35:q32) 
chromosome translocation, are associated with several post-thymic types of T-cell 
leukemias, including, but not limited to, T-prolymphocytic leukemias (T-PLL) 
(Brito-Babapulle and Catovsky, 1991, Cancer Genet Cytogenet, 55:1-9), acute and 
chronic leukemias associated with the immunodeficiency syndrome ataxia- 
telangiectasia (AT) (Russo et al., 1988, Cell, 53:137-144; Russo et al., 1989, Proc. 
Natl. Acad. Sci. U.S.A. 86:602-606), and adult T-cell leukemia (Virgilio et al., 
1993, PNAS, 90:9275-9279). In some cases of AT-associated translocations, in T- 
cell leukemia and lymphoma involving the 14q32.1 band, clonal expansion of cells 
carrying abnormalities in 14q32.1 have been documented in some cases prior to the 
development of overt malignancy (Russo, et al., 1988 Cell,. 53:137-144). 
Therefore, a TCL-lb, TNG1 or TNG2 polynucleotide, its Tel -lb, Tngl or Tng2, 
respectively, protein product and antibodies thereto are used for diagnostic and/or 
therapeutic/prophylactic purposes for the above described diseases, as well as other 
disorders associated with chromosomal translocations and inversions associated with 
the TCL-lb , TNG1 or TNG2 gene locus and/or, increased expression of TCL-lb, 
TNG1 or TNG2 RNA or protein, respectively. A TCL-lb, TNG1 or TNG2 
polynucleotide, its encoded protein product and antibodies thereto are used for 
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therapeutic/prophylactic purposes alone or in combination with other therapeutics 
useful in the treatment of T-cell leukemias. Such molecules are also used in 
diagnostic assays, such as immunoassays, to detect, prognose, diagnose, or monitor 
various conditions, diseases, and disorders associated with TCL-lb , TNG1 or TNG2 

5 gene expression or monitor the treatment thereof. Accordingly, in specific 
embodiments, T-cell malignancies or premalignant changes in such tissues is 
diagnosed by detecting increased TCL-lb, TNG1 or TNG2 gene expression in patient 
samples relative to the level of TCL-lb , TNG1 or TNG2 gene expression in an 
analogous non-malignant sample (from the patient or another person, as determined 

10 experimentally or as is known as a standard level in such samples). For diagnostic 
purposes, a TCL-lb, TNG1 or TNG2 polynucleotide is used to detect TCL-lb , 
TNG1 or TNG2 gene, respectivley, expression or increased TCL-lb , TNG1 or 
TNG2 gene expression in disease states, such as, T-cell leukemias and lymphomas. 
For therapeutic purposes, a Tel -lb, Tngl or Tng2 protein is used to make anti- Tel 

15 lb, anti-Tngl or anti-Tng2 antibodies that neutralize the activity of Tel -lb, Tngl or 
Tng2, respectively. Included within the scope of the present invention are 
oligonucleotide sequences, that include antisense RNA and DNA molecules and 
ribozymes, that function to inhibit expression of a TCL-lb, TNG1 or TNG2 RNA 
or protein. 

20 

Diagnostic Uses 

The TCL-lb , TNG1 or TNG2 gene sequence is associated with disease states 
associated with chromosome 14 translocations and inversions around the TCL-lb, 

25 TNG1 or TNG2 gene locus, is preferentially expressed early in T and B lymphocyte 
differentiation and demonstrates a high level of expression in cells from patients 
diagnosed with T-PLL carrying an inversion of chromosome 14, inv(14)(qll;q32) 
or patients carrying a t(14:14)(qll;q32) chromosome translocation. Accordingly, 
TCL-lb , TNG1 or TNG2 gene sequences (SEQ. ID. NO: 40, 45, and 46, 

30 respectively) are used diagnostically for the detection of diseases states resulting 
from chromosomal abnormalities, e.g., translocations, inversions and deletions, 
involving the TCL-lb, TNG1 or TNG2 gene locus of chromosome 14. Nucleic acids 
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comprising TCL-lb f TNG1 or TNG2 nucleotide sequences of at least 8 nucleotides, 
at least 15 nucleotides, at least 25 nucleotides, at least 50 nucleotides, at least 100 
nucleotides, at least 200 nucleotides, at least 300 nucleotides, or at least 387 
nucleotides up to 1324 nucleotides of SEQ ID NO: 38, 41 and 43 cDNA, 
5 respectivley, are used as probes in hybridization assays for the detection and 
measurement of TCL-lb , TNG1 or TNG2 gene (SEQ. ID. NO: 40, 45, and 46, 
respectively). Nucleic acids of not more than 5 kilobases, of not more than 10 
kilobases, not more than 25 kilobases, not more than 50 kilobases or not more than 
70 kilobases which are hybridizable to a TCL-lb , TNG1 or TNG2 gene, cDNA, or 
10 complementary strand is used as probes in hybridization assays for the detection and 
measurement of TCL-lb, TNG1 or TNG2 nucleotide sequences. As an example, the 
TCL-lb , TNG1 or TNG2 DNA sequence is used in hybridization assays, e.g., 
^ Southern or Northern analysis, including in situ hybridization assays, of patient's 

Ul samples to diagnose abnormalities of TCL-lb , TNG1 or TNG2 gene expression, 

CJ1 15 respectively. Hybridization assays are used to detect, prognose, diagnose, or 

»t monitor conditions, disorders, or disease states, such as T-cell malignancies, 

associated with aberrant changes in TCL-lb , TNG1 or TNG2 expression and/or 
p activity as described supra. In particular, such a hybridization assay is carried out 

f : by a method comprising contacting a sample containing nucleic acid with a nucleic 

Ul 20 acid probe capable of hybridizing to TCL-lb , TNG1 or TNG2 DNA or RNA, under 

Q conditions such that hybridization can occur, and detecting or measuring any 

resulting hybridization. In particular, hybridization assays are used to detect the 
presence of abnormalities associated with increased expression of TCL-lb, TNG1 or 
TNG2 mRNA, by hybridizing mRNA or cDNA from a patient sample to a TCL-lb, 
25 TNG1 or TNG2, respectively, probe, and measuring the amount of resulting 
hybridization. For example, assays which are used include, but are not limited to 
Northern blots, Dot blots, reverse transcriptase PCR, etc. A preferred hybridization 
assay is Northern blot analysis of a patient sample using TCL-lb , TNG1 or TNG2 
gene probes of at least 15 polynucleotides up to the fall length cDNA sequence of 
30 each respective gene (SEQ. ID. NO: 38, 41 and 43, respectively). . Another 
preferred hybridization assay is in situ hybridization analysis of a patient sample 
using anti-Tcl-lb, anti-Tngl or anti-Tng2 antibodies or TCL-lb, TNG1 or TNG2 
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nucleotide hybridization probes. Such techniques are well known in the art, and are 
in fact the basis of many commercially available diagnostic kits. 

As used herein, patient samples which are used include, but are not limited 
to, fresh or frozen tissue samples, which are used in in situ hybridization assays; 
5 cell or tissue samples containing T-lymphocytes and, in general, patient samples 
containing nucleic acid, such as peripheral blood lymphocytes (PBL) and T- 
lymphocytes which are used in assays that measure or quantitate TCL-lb, TNG1 or 
TNG2 nucleic acid. 

Polynucleotide sequences of TCL-lb, TNG1 or TNG2 consisting of at least 

10 8 to 25 nucleotides that are useful as primers in primer dependent nucleic acid 
amplification methods are used for the detection of TCL-lb, TNG1 or TNG2, 
resepectively, gene sequences in patient samples. Primer dependent nucleic acid 
amplification methods useful in the present invention include, but are not limited to, 
polymerase chain reaction (PCR), competitive PCR, cyclic probe reaction, and 

15 ligase chain reaction. Such techniques are well known by those of skill in the art. A 
preferred nucleic acid amplification method of the present invention is reverse 
transcriptase PCR (RT-PCR) (Siebert, et al., 1992, Nature, 359:557-558). 

In a particular embodiment of the present invention, each primer of a pair of 
primers for use in a primer dependent nucleic aid amplification method is selected 

20 from a different exon of the genomic TCL-lb, TNG1 or TNG2 nucleotide sequences. 
For example, if one primer of a pair or primers is selected from exon 1 of the TCL- 
lb, TNG1 or TNG2 genomic sequence, the second primer will be selected from exon 
2, 3 or 4 of the TCL-lb or TNG2, respectively, or exon 2 of the TNG1 genomic 
sequence. As another example, if one primer of a pair of primers is selected from 

25 exon 2 of the TCL-lb or TNG2 genomic sequence, the second primer will be 
selected from exon 1, 3, or 4 of the TCL-lb TNG2 genomic sequence, respectively. 
By selecting each primer of a pair of primers for use in a primer dependent nucleic 
acid amplification method from a different exon, amplified genomic nucleotide 
sequences are distinguished from amplified cDNA nucleotide sequences due to the 

30 size difference of the resulting amplified sequences. Resulting amplified genomic 
nucleotide sequences will contain amplified intron sequences and will be of a larger 
size than amplified cDNA nucleotide sequences that will not contain amplified 
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intron sequences. For amplification of cDNA nucleotide sequences, the primer 
sequences should be selected from exons sequences that are sufficiently far enough 
apart to provide a detectable amplified nucleotide sequence. 

The TCL-lb, TNG1 or TNG2 gene sequences of the present invention (SEQ. 

5 ID. NO: 40, 45, and 46, respectively) are used diagnostically for the detection of 
chromosome 14 abnormalities, in particular translocations t(14:14)(qll:q32) and 
inv(14)(qll;q32) inversion at 14q32.1. Accordingly, the present invention provides 
a process for detecting a target sequence indicative of or including a chromosome 14 
abnormality in a sample, comprising the steps of amplifying the target sequence in 

10 the sample using a first primer of 8 to 25 nucleotides, preferably 18-25 nucleotides, 
complementary to the nucleotide sequence of SEQ ID NO: 40 {TCL-lb), 45 (TNG1) 
or 46 (TNG2) or SEQ ID NO: 38 (TCL-lb) , 41 (TNG1), or 44 (TNG2) and a second 
primer complementary to a region teleomeric or centromeric to the TCL-lb , TNG1 
or TNG2 gene, respectively, and detecting any resulting amplified target sequence in 

15 which the presence of the amplified target sequence is indicative of the abnormality. 
The present invention also provides a method of diagnosing a T-cell malignancy 
associated with chromosome 14 abnormalities in a patient by detecting a 
chromosome 14 abnormality according to the method above in which the presence 
of the amplified target sequence indicates the presence of a T-cell malignancy in the 

20 patient. The resultant amplified target sequence is detected on gel electrophoresis 
and compared with a normal sample or standard that does not contain a chromosome 
14 abnormality. Virgilio et al., supra, disclose polynucleotide sequences useful as 
second primers. Other polynucleotide sequences useful as second primers are 
selected from the T-cell receptor .a/8 locus, the T-cell receptor p. chain, or if the 

25 chromosome 14 abnormality involves aninversion, a polynucleotide sequence 5 ' to 
exon 1 of the TCL-lb , TNG1 or TNG2 gene, or if the chromosome abnormality 
involves a translocation, a polynucleotide sequence 3' to the 3' intron of the TCL- 
lb, TNG1 or TNG2 gene. The amplification of genomic DNA target sequences may 
require generating long PCR products. PCR techniques for generating long PCR 

30 products are described in Science (1994) 263:1564-1565; PCR kits for generating 
long PCR products are available from Perkin Elmer and Takara Shuzo Co., Ltd. 
The present invention also provides a method for detecting a target nucleotide 
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sequence indicative of or including at least a portion of a chromosome 14 
abnormality in a nucleic acid sample, comprising the steps of hybridizing the sample 
with a nucleic acid probe of not more than 10 kilobases, comprising in the range of 
15-1324 nucleotides complementary to at least a portion of the nucleotide sequence 

5 of SEQ ID NO: 40 (TCL-lb), 45 (77VG7) or 46 (TNG2) and detecting or measuring 
the amount of any resulting hybridization between the probe and the target sequence 
within the sample. The resultant hybridization between the probe and the target 
sequence within the sample is detected using gel electrophoresis and is compared to 
a target sequence from a normal sample or standard that does not contain a 

10 chromosome 14 abnormality. The present invention also provides a method of 
diagnosing a T-cell malignancy associated with chromosome 14 abnormalities in a 
patient comprising, detecting said chromosome 14 abnormality according to the 
method above in which the presence of the amplified target sequence indicates the 
presence of a T-cell malignancy in the patient. Absolute complementarity between a 

15 hybridization probe and a target sequence, although preferred, is not required. A 
sequence "complementary to at least a portion of", as referred to herein, means a 
sequence having sufficient complementarity to be able to hybridize with the nucleic 
acid, forming a stable hybridization complex. The ability to hybridize will depend 
on both the degree of complementarity and the length of the nucleic acid. Generally, 

20 the longer the hybridizing nucleic acid, the more base mismatches with a TCL-lb, 
TNG1 or TNG2 RNA it may contain and still form a stable duplex (or triplex, as 
the case is) . One skilled in the art can ascertain a tolerable degree of mismatch by 
use of standard procedures to determine the melting point of the hybridized 
complex. 

25 An additional aspect of the present invention relates to diagnostic kits for the 

detection or measurement of TCL-lb , TNG1 or TNG2 gene sequences and Tel- lb, 
Tngl or Tng2, respectively, protein. Accordingly, the present invention provides a 
diagnostic kit comprising, in a container a compound comprising a probe of not 
more than 10 kilobases and comprising in the range of 15-1324 nucleotides of the 

30 nucleotide sequence of SEQ ID NO: 38 (TCL-lb), 41 (TNG1) or 43 (TNG2) or its 
complement. Alternatively, the present invention provides a diagnostic kit 
comprising, in one or more containers, a pair of primers of at least 8-25 nucleotides 
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in which at least one of the primers is hybridizable to SEQ ID NO: 38 (TCL-lb), 41 
(TNG1) or 43 (TNG2) or its complement and wherein the primers are capable of 
priming cDNA synthesis in an amplification reaction. The present invention also 
provides a diagnostic kit in which at least one of the primers is hybridizable to SEQ 

5 ID NO: 38 (TCL-lb), 41 (TNG1) or 43 (TNG2) or its complement and in which one 
of the primers is hybridizable to a DNA sequence located telomeric or centromeric 
to the TCL-lb , TNG1 or TNG2 gene. In a specific embodiment, one of the 
foregoing compounds of the container is detectably labeled. 

The amplification reaction of the present invention are a polymerase chain 

10 reaction, competitive PCR and competitive reverse-transcriptase PCR (Clementi, et 
al., 1994, Genet Anal Tech Appl, 11(1): 1-6; Siebert et al., 1992, Nature, 359:557- 
558); cyclic probe reaction, which allows for amplification of a target sequence 
using a hybrid RNA/DNA probe and RNase (ID Biomedical); ligase chain reaction 
(Wu, et al., 1989, Genomics, 4:560-569). In a particular embodiment, the 

15 chromosomal abnormality associated with a TCL-lb f TNG1 or TNG2 locus is 
detected as described in PCT Publication No. WO/92/19775, dated Nov. 12, 1992. 
In a specific embodiment, the TCL-lb, TNG1 or TNG2 probe used in a 
hybridization assay is detectably labeled. Such a label is any known in the art 
including, but not limited to, radioactive labels, fluorescent labels, biotin, 

20 chemiluminescent labels, etc. 

In a specific embodiment in which the assay used employs primers, at least 
one primer is detectably labeled. In another embodiment, one of a primer pair is 
attached to a moiety providing for capture, e.g., a magnetic bead. 

Anti-Tcl-lb, anti-Tngl or anti-Tng2 antibodies are generated and used 

25 diagnostically to detect the presence of Tel -lb, Tngl or Tng2 protein product, 
respectively, in patient samples thereby identifying disease states associated with 
chromosome 14 abnormalities. For detection of Tel -lb, Tngl or Tng2 protein 
sequences (SEQ. ID. NO: 39, 42, or 44, respectively), a diagnostic kit of the 
present invention comprises, in one or more containers, an anti- Tel -lb , anti-Tngl 

30 or anti-Tng2 antibody which optionally is detectably labeled. In a different 
embodiment, the kit can comprise in a container, a labeled specific binding portion 
of an antibody. As used herein, the term detectable label refers to any label which 
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provides directly or indirectly a detectable signal and includes, for example, 
enzymes, radiolabeled molecules, fluorescent molecules, particles, 
chemiluminesors, enzyme substrates or cofactors, enzyme inhibitors, or magnetic 
particles. Examples of enzymes useful as detectable labels in the present invention 

5 include alkaline phosphatase and horse radish peroxidase. A variety of methods are 
available for linking the detectable labels to proteins of interest and includee, for 
example, the use of a bifunctional agent, such as, 4,4'-difluoro-3,3'-dinitro- 
phenylsulfone, for attaching an enzyme, for example, horse radish peroxidase, to a 
protein of interest. The attached enzyme is then allowed to react with a substrate 

10 yielding a reaction product which is detectable. The present invention provides a 
method for detecting a Tel -lb, Tngl or Tng2 protein in a patient sample, 
comprising, contacting the patient sample with an anti- Tel -lb, anti-Tngl or anti- 
Tng2 antibody, respectively, under conditions such that immunospecific binding 
occurs, and detecting or measuring the amount of any immunospecific binding by 

15 the antibody. 

Samples are any sample from a patient containing Tel -lb, Tngl or Tng2 
protein, e.g., tissue sections, peripheral blood lymphocytes, etc In diagnosing 
disease states, the functional activity of Tel -lb, Tngl or Tng2 proteins, derivatives 
and analogs are assayed by various methods. Accordingly, the present invention also 

20 provides a method of diagnosing a T-cell malignancy associated with chromosome 
14 abnormalities in a patient comprising, detecting increased expression of Tel -lb, 
Tngl or Tng2 protein in a sample from the patient, in which an increase in Tel -lb, 
Tngl or Tng2, respectivley, protein relative to the level found in such an analogous 
sample from a normal individual, indicates the presence of a T-cell malignancy in 

25 the patient. 

For example, in one embodiment, where one is detecting or measuring Tel - 
lb, Tngl or Tng2 protein by assaying for binding to anti- Tel -lb, anti-Tngl or 
anti-Tng2 antibody, respectively, various immunoassays known in the art are used, 
including, but not limited to, competitive and non-competitive assay systems using 
30 techniques such as radioimmunoassays, ELISA (enzyme linked immunosorbent 
assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion 
precipitin reactions, mmunodiffusion assays, in situ immunoassays (using colloidal 
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gold, enzyme or radioisotope abels, for example), western blots, in situ 
hybridizations, precipitation reactions, agglutination ssays (e.g., gel agglutination 
assays, hemagglutination assays), complement fixation assays, mmunofluorescence 
assays, protein A assays, and Immunoelectrophoresis assays, etc. In one 
5 mbodiment, antibody binding is detected by detecting a label on the primary 
antibody. In nother embodiment, the primary antibody is detected by detecting 
binding of a secondary antibody or reagent to the primary antibody. In a further 
embodiment, the secondary antibody is labelled. Many means are known in the art 
for detecting binding in an immunoassay and are within the scope of the present 
10 invention. In particular, such an immunoassay is carried out by a method 
comprising contacting a sample derived from a patient with an anti- Tel -lb, anti- 
Tngl or anti-Tng2 antibody under conditions such that immunospecific binding 
occurs, and detecting or measuring the amount of any immunospecific binding by 
the antibody. In a specific embodiment, antibody to a Tel -lb, Tngl or Tng2 protein 
15 is used to assay a patient tissue or serum sample for the presence of a Tel -lb, Tngl 
or Tng2 protein, respectively, where an increased level of Tel -lb, Tngl or Tng2 
protein is an indication of a diseased condition. In one embodiment of the present 
invention, the Tel -lb, Tngl or Tng2 protein is detected or measured by 
immunocytochemistry of a patient sample. In another embodiment, assays to 
20 measure the levels of Tel -lb, Tngl or Tng2 protein or RNA is used to moniter 
therapy of disease associated with increased expression of Tel -lb. For example, a 
decrease in levels of TCL-lb, TNG1 or TNG2 RNA or protein after therapy, relative 
to the level found before therapy, are indicative of a favorable response to therapy. 
An increase in such levels after therapy are indicative of a poor response to therapy. 
25 In another embodiment, the levels of Tel -lb, Tngl or Tng2 protein or RNA 

expression are used to stage disease, with an increase in Tel -lb, Tngl or Tng2 
protein or RNA, respectively, expression indicating disease progression. 

Other methods will be known to the skilled artisan and are within the scope 
of the invention. 



30 



Therapeutic/Prophylactic Uses 
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Inhibitors of Tel- lb, Tngl or Tng2 are used therapeutically for the treatment 
of disease states associated with chromosome 14 abnormalities, in particular at 
14q32.1, and/or increased expression of Tel -lb, Tngl or Tng2 protein, 
respectively. In an embodiment of the present invention, a Tel -lb, Tngl or Tng2 

5 protein and/or cell line that expresses a Tel -lb, Tngl or Tng2 protein, respectively, 
is used to screen for antibodies, peptides, or other molecules that bind to the Tel - 
lb, Tngl or Tng2 protein and thus may act as agonists or antagonists of Tel -lb, 
Tngl or Tng2 protein. For example, anti-Tel -lb, anti-Tngl or anti-Tng2 antibodies 
capable of neutralizing the activity of a Tel -lb, Tngl or Tng2 protein, respectively, 

10 are used to inhibit or prevent a disease state associated with chromosome 14 
abnormalities and/or expression of Tel -lb, Tngl or Tng2 protein, such as T-cell 
leukemia and lymphoma. Accordingly, the present invention provides a method for 
treating a disease state associated with a chromosome 14 abnormality in mammal 
suffering from a disease state associated with a chromosome 14 abnormality 

15 comprising, administering a therapeutically effective amount of an anti- Tel -lb, 
anti-Tngl or anti-Tng2 antibody to a mammal suffering from a disease state 
associated with a chromosome 14 abnormality. Alternatively, screening of organic 
or peptide libraries with recombinantly expressed Tel -lb, Tngl or Tng2 protein are 
useful for identification of therapeutic molecules that function to inhibit the activity 

20 of Tel -lb, Tngl or Tng2 protein, respectively. Synthetic and naturally occurring 
products are screened in a number of ways deemed routine to those of skill in the 
art. 

The ability of antibodies, peptides or other molecules to modulate the effect 
of Tel -lb, Tngl or Tng2 protein on disease states is monitored. For example, the 

25 expression of TCL-lb , TNG1 or TNG2 gene sequences (SEQ. ID. NO: 40, 45, or 
46, respectively) or Tel -lb, Tngl or Tng2 protein sequences (SEQ. ID. NO: 38, 
42, or 44, respectively) are detected as described, supra, both before and after 
administration of a therapeutic composition comprising a TCL-lb, TNG1 or TNG2 
nucleotide sequence, Tel -lb, Tngl or Tng2 protein sequence, derivative or analog 

30 thereof, or antibody thereto, respectivley, of the present invention. 

A TCL-lb, TNG1 or TNG2 polynucleotide is useful in the treatment of 
various disease states associated with chromosome 14 abnormalities, such as T-cell 
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leukemias and lymphomas, and/or increased expression of Tel- lb, Tngl or Tng2 
protein. By introducing TCL-lb, TNG1 or TNG2 antisense gene sequences into 
cells, gene therapy is used to treat conditions associated with over-expression of 
TCL-lb , TNG1 or TNG2 genes, respectively. Accordingly, the present invention 

5 provides a method for treating a disease state associated with a chromosome 14 
abnormality in mammal suffering from a disease state associated with a chromosome 
14 abnormality comprising, administering a therapeutically effective amount of a 
TCL-lb, TNG1 or TNG2 antisense molecule to a mammal suffering from a disease 
state associated with a chromosome 14 abnormality. 

10 Oligonucleotide sequences, that include antisense RNA and DNA molecules 

and ribozymes that function to inhibit the translation of a TCL-lb, TNG1 or TNG2 
mRNA are within the scope of the invention. "Antisense" as used herein refers to a 
nucleic acid capable of hybridizing to a portion of a TCL-lb, TNG1 or TNG2 RNA 
(preferably mRNA) by virtue of some sequence complementarity. Antisense RNA 

15 and DNA molecules act to directly block the translation of mRNA by binding to 
targeted mRNA and preventing protein translation. In regard to antisense DNA, 
oligodeoxy ribonucleotides derived from the translation initiation site, e.g., between 
-10 and +10 regions of a TCL-lb, TNG1 or TNG2 nucleotide sequence, are 
preferred. The present invention provides for an antisense molecule comprising a 

20 nucleotide sequence complementary to at least a part of the coding sequence of a 
Tel- lb, Tngl or Tng2 protein which is hybridizable to a TCL-lb, TNG1 or TNG2 
mRNA, respectively. The present invention also provides for an antisense molecule 
with a nucleotide sequence complementary to at least a part of the non-coding 
sequence (SEQ ID NO: 40, 45, or 46, respectively) which hybridizes to the TCL-lb, 

25 TNG1 or TNG2 coding sequence (SEQ ID NO: 40, 45, or 46, respectively). In a 
preferred embodiment of the present invention, the antisense gene sequence is 
derived from the 5' non-coding sequence of a TCL-lb, TNG1 or TNG2 gene. In a 
particularly preferred embodiment of the present invention, the antisense gene 
sequence is derived from TCL-lb, TNG1 or TNG2 gene (SEQ ID NO: 38, 41, or 

30 43, respectively). 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 
cleavage of RNA. The mechanism of ribozyme action involves sequence specific 
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hybridization of the ribozyme molecule to complementary target RNA, followed by 
a endonucleolytic cleavage. Within the scope of the invention are engineered 
hammerhead motif ribozyme molecules that specifically and efficiently catalyze 
endonucleolytic cleavage of TCL-lb, TNG1 or TNG2 RNA sequences. 

5 Specific ribozyme cleavage sites within any potential RNA target are initially 

identified by scanning the target molecule for ribozyme cleavage sites which include 
the following sequences, GUA, GUU and GUC. Once identified, short RNA 
sequences of between 15 and 20 ribonucleotides corresponding to the region of the 
target gene containing the cleavage site are evaluated for predicted structural 

10 features, such as secondary structure that may render the oligonucleotide sequence 
unsuitable. The suitability of candidate targets may also be evaluated by testing their 
accessibility to hybridization with complementary oligonucleotides, using 

ribonuclease protection assays. 

Both anti-sense RNA and DNA molecules and ribozymes of the invention are 

15 prepared by any method known in the art for the synthesis of RNA molecules. 
These include techniques for chemically synthesizing oligodeoxyribonucleotides 
well known in the art such as for example solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules are generated by in vitro and in vivo 
transcription of DNA sequences encoding the antisense RNA molecule. Such DNA 

20 sequences are incorporated into a wide variety of vectors which incorporate suitable 
RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, is introduced stably 
into cell lines. 

25 Various modifications to the DNA molecules are introduced as a means of 

increasing intracellular stability and half-life. Examples of modifications include, 
but are not limited to, the addition of flanking sequences of ribo- or deoxy- 
nucleotides to the 5' and/or 3' ends of the molecule or the use of phosphorothioate 
or T O-methyl rather than phosphodiesterase linkages within the 

30 oligodeoxy ribonucleotide backbone. 

Methods for introducing nucleic acid into cells or tissue include methods for 
in vitro introduction of nucleic acid such as the insertion of naked nucleic acid, i.e., 
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by injection into tissue, the introduction of a nucleic acid in a cell ex vivo, the use of 
a vector such as a virus, retrovirus, phage or plasmic, etc. or techniques such as 
electroporation which are used in vivo or ex vivo. 

Other methods will be known to the skilled artisan and are within the scope 

5 of the invention. 



Demonstration of Therapeutic or Prophylactic Utility 

The TCL-lb, TNG1 or TNG2 polynucleotides, TcMb, Tngl or Tng2 
10 protein products, respectivley, derivatives and analogs thereof, and antibodies 
thereto, of the invention are tested in vivo for the desired therapeutic or prophylactic 
activity. For example, such compounds are tested in suitable animal model systems 
" prior to testing in humans, including but not limited to rats, mice, chicken, cows, 

U1 monkeys, rabbits, etc. For in vivo testing, prior to administration to humans, any 

m 15 animal model system known in the art are used. 

'£ 3.,; 

^ Therapeutic/Prophylactic Methods and Compositions 

[r: The invention provides methods of treatment and prophylaxis by 

20 administration to a subject of an effective amount of a Therapeutic, i.e., a TCL-lb, 
Q TNG1 or TNG2 polynucleotide, Tel- lb, Tngl or Tng2 protein, respectively, 

derivative or analog thereof, or antibody thereto of the present invention. In a 
preferred aspect, the Therapeutic is substantially purified. The subject is preferably 
an animal, including but not limited to animals such as cows, pigs, chickens, etc., 
25 and is preferably a mammal, and most preferably human. 

Various delivery systems are known and used to administer a Therapeutic of 
the invention, e.g., encapsulation in liposomes, microparticles, microcapsules, 
expression by recombinant cells, receptor-mediated endocytosis (see, e.g., Wu and 
Wu, 1987, / Biol Chem, 262:4429-4432), construction of a therapeutic nucleic acid 
30 as part of a retroviral or other vector, etc. Methods of introduction include but are 
not limited to intradermal, intramuscular, intraperitoneal, intravenous, 
subcutaneous, intranasal, and oral routes. The compounds are administered by any 



58 



PATENT CRO01 .NP003 

convenient route, for example by infusion or bolus injection, by absorption through 
epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, 
etc.) and are administered together with other biologically active agents. 
Administration is systemic or local. In addition, it are desirable to introduce the 
5 pharmaceutical compositions of the invention into the central nervous system by any 
suitable route, including intraventricular and intrathecal injection; intraventricular 
injection are facilitated by an intraventricular catheter, for example, attached to a 
reservoir, such as an Ommaya reservoir. 

In a specific embodiment, it are desirable to administer the pharmaceutical 
10 compositions of the invention locally to the area in need of treatment; this are 
achieved by, for example, and not by way of limitation, local infusion during 
surgery, topical application, e.g., in conjunction with a wound dressing after 
surgery, by injection, by means of a catheter, by means of a suppository, or by 
means of an implant, said implant being of a porous, non-porous, or gelatinous 
15 material, including membranes, such as sialastic membranes, or fibers. In one 
embodiment, administration is by direct injection at the site (or former site) of a 
malignant tumor or neoplastic or pre-neoplastic tissue. 

In a specific embodiment where the Therapeutic is a nucleic acid encoding a 
protein therapeutic, the nucleic acid is administered in vivo to promote expression of 
20 its encoded protein, by constructing it as part of an appropriate nucleic acid 
expression vector and administering it so that it becomes intracellular, e.g., by use 
of a retroviral vector (see U.S. Pat. No. 4,980,286), or by direct injection, or by 
use of microparticle bombardment (e.g., a gene gun; Biolistic, Dupont), or coating 
with lipids or cell-surface receptors or transfecting agents, or by administering it in 
25 linkage to a homeobox-like peptide which is known to enter the nucleus (see e.g., 
Joliot, et al., 1991, Proc Natl Acad Sci USA, 88:1864-1868), etc. Alternatively, a 
nucleic acid therapeutic is introduced intracellularly and incorporated within host 
cell DNA for expression, by homologous recombination. 

The present invention also provides pharmaceutical compositions. Such 
30 compositions comprise a therapeutically effective amount of a therapeutic, and a 
pharmaceutical^ acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
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combinations thereof. The carrier and composition are sterile. The formulation 
should suit the mode of administration. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition is a liquid solution, 

5 suspension, emulsion, tablet, pill, capsule, sustained release formulation, or 
powder. The composition is formulated as a suppository, with traditional binders 
and carriers such as triglycerides. Oral formulation includes standard carriers such 
as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, etc. 

10 In a preferred embodiment, the composition is formulated in accordance with 

routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compositions for intravenous 
administration are solutions in sterile isotonic aqueous buffer. Where necessary, the 
composition may also include a solubilizing agent and a local anesthetic such as 

15 lignocaine to ease pain at the site of the injection. Generally, the ingredients are 
supplied either separately or mixed together in unit dosage form, for example, as a 
dry lyophilized powder or water free concentrate in a hermetically sealed container 
such as an ampoule or sachette indicating the quantity of active agent. Where the 
composition is to be administered by infusion, it is dispensed with an infusion bottle 

20 containing sterile pharmaceutical grade water or saline. Where the composition is 
administered by injection, an ampoule of sterile water for injection or saline is 
provided so that the ingredients are mixed prior to administration. 

The Therapeutics of the invention are formulated as neutral or salt forms. 
Pharmaceutical^ acceptable salts include those formed with free amino groups such 

25 as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., 
and those formed with free carboxyl groups such as those derived from sodium, 
potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 
2-ethylamino ethanol, histidine, procaine, etc. 

The amount of the Therapeutic of the invention which will be effective in the 

30 treatment of a particular disorder or condition will depend on the nature of the 
disorder or condition, and can be determined by standard clinical techniques. In 
addition, in vitro assays are employed to help identify optimal dosage ranges. The 
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precise dose to be employed in the formulation will also depend on the route of 
administration, and the seriousness of the disease or disorder, and should be decided 
according to the judgment of the practitioner and each patient's circumstances. 
However, suitable dosage ranges for intravenous administration are generally about 

5 20-500 micrograms of active compound per kilogram body weight. Suitable dosage 
ranges for intranasal administration are generally about 0.01 pg/kg body weight to 1 
mg/kg body weight. Effective doses are extrapolated from dose-response curves 
derived from in vitro or animal model test systems. 

Suppositories generally contain active ingredient in the range of 0.5% to 10k 

10 by weight; oral formulations preferably contain 10% to 95% active ingredient. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such container(s) is a 
notice in the form prescribed by a governmental agency regulating the manufacture, 

15 use or sale of pharmaceuticals or biological products, which notice reflects approval 
by the agency of manufacture, use or sale for human administration. 

Antisense Regulation of TCL-lb y TNG1 and TNG2 Gene Expression 

20 The present invention provides the therapeutic or prophylactic use of nucleic 

acids of at least six nucleotides that are antisense to a TCL-lb, TNG1 or TNG2 gene 
(SEQ. ID. NO: 40, 45, or 46, respectively) or TCL-lb, TNG1 or TNG2 cDNA 
(SEQ. ID. NO: 38, 41, or 43, respectively) encoding Tel- lb, Tngl or Tng2 (SEQ. 
ID. NO: 39, 42, or 44), respectively, or a portion thereof. Such antisense nucleic 

25 acids have utility as Antagonist Therapeutics of the invention, and is used in the 
treatment or prevention of disorders, e.g., T-cell malignancies as described supra. 

The antisense nucleic acids of the invention are oligonucleotides that are 
double-stranded or single-stranded, RNA or DNA or a modification or derivative 
thereof, which can be directly administered to a cell, or which are produced 

30 intracellular^ by transcription of exogenous, introduced sequences. 

In a specific embodiment, the TCL-lb, TNG1 or TNG2 antisense 
polynucleotides provided by the instant invention can be used for the treatment of 
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disease states associated with chromosome 14 abnormalities, in particular at 
14q32.1, wherein the disease state can be demonstrated (in vitro or in vivo) to 
express the TCL-lb 9 TNG1 or TNG2 gene, respectively. Such demonstration can be 
by detection of TCL-lb, TNG1 or TNG2 RNA or of Tel- lb, Tngl or Tng2 protein, 
5 respectively . 

The invention further provides pharmaceutical compositions comprising an 
effective amount of the TCL-lb, TNG1 or TNG2 antisense nucleic acids of the 
invention in a pharmaceutical^ acceptable carrier, as described supra. Methods for 
treatment and prevention of disease states associated with chromosome 14, such as 

10 T-cell malignancies comprising administering the pharmaceutical compositions of 
the invention are also provided. 

In another embodiment, the invention is directed to methods for inhibiting 
the expression of a TCL-lb, TNG1 or TNG2 nucleic acid sequence in a prokaryotic 
or eukaryotic cell comprising providing the cell with an effective amount of a 

15 composition comprising an antisense TCL-lb, TNG1 or TNG2 nucleic acid, 
respectively, of the invention. 

The TCL-lb, TNG1 or TNG2 antisense polynucleotides are of at least six 
nucleotides and are preferably oligonucleotides (ranging from 6 to about 50 
oligonucleotides). In specific aspects, the oligonucleotide is at least 10 nucleotides, 

20 at least 20 nucleotides, at least 30 nucleotides, or at least 40 nucleotides. The 
oligonucleotides are DNA or RNA or chimeric mixtures or derivatives or modified 
versions thereof, single-stranded or double-stranded. The oligonucleotide is 
modified at the base moiety, sugar moiety, or phosphate backbone. The 
oligonucleotide may include other appending groups such as peptides, or agents 

25 facilitating transport across the cell membrane (see, e.g., Letsinger, et al., 1989, 
Proc Natl Acad Sci USA, 86:6553-6556; Lemaitre, et al., 1987, Proc Natl Acad Sci 
USA, 84:648-652; PCT Publication No. WO 88/09810, published Dec. 15, 1988) or 
blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134, published Apr. 
25, 1988), hybridization-triggered cleavage agents (see, e.g., Krol, et al., 1988, 

30 BioTechniques , 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm Res 
5:539-549). 
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The oligonucleotide are conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 

cleavage agent, etc. 

Oligonucleotides of the invention are synthesized by standard methods 

5 known in the art, e.g., by use of an automated DNA synthesizer (such as are 
commercially available from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorofhioate oligos are synthesized by the method of Stein et al. (1988, Nucl. 
Acids Res. 16:3209), mefhylphosphonate oligos are prepared by use of controlled 
pore glass polymer supports (Sarin, et al., 1988, Proc Natl Acad Sci USA, 85:7448- 

10 7451), etc. 

In a specific embodiment, the TCL-lb, TNG1 or TNG2 antisense 
oligonucleotide comprises catalytic RNA, or a ribozyme (see, e.g., PCT 
International Publication WO 90/11364, published Oct. 4, 1990; Sarver, et al., 
1990, Science, 247:1222-1225). In another embodiment, the oligonucleotide is a 2'- 

15 O-methylribonucleotide (Inoue, et al., 1987, Nucl Acids Res, 15:6131-6148), or a 
chimeric RNA-DNA analogue (Inoue, et al., 1987, FEBS Lett, 215:327-330). 

In an alternative embodiment, the TCL-lb, TNG1 or TNG2 antisense 
nucleic acid of the invention is produced intracellularly by transcription from an 
exogenous sequence. For example, a vector is introduced in vivo such that it is taken 

20 up by a cell, within which cell the vector or a portion thereof is transcribed, 
producing an antisense nucleic acid (RNA) of the invention. Such a vector would 
contain a sequence encoding the TCL-lb, TNG1 or TNG2 antisense nucleic acid, 
respectively. Such a vector can remain episomal or become chromosomally 
integrated, as long as it is transcribed to produce the desired antisense RNA. Such 

25 vectors are constructed by recombinant DNA technology methods standard in the 
art. Vectors are plasmid, viral, or others known in the art, used for replication and 
expression in mammalian cells. Expression of the sequence encoding the TCL-lb, 
TNG1 or TNG2 antisense RNA is by any promoter known in the art to act in 
mammalian, preferably human, cells. Such promoters are inducible or constitutive. 

30 Such promoters include but are not limited to: the SV40 early promoter region 
(Bernoist and Chambon, 1981, Nature, 290:304-310), the promoter contained in the 
3' long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell, 
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22:787-797), the herpes thymidine kinase promoter (Wagner, et al., 1981, Proc 
Natl Acad Sci USA, 78:1441-1445), the regulatory sequences of the metallothionein 
gene (Brinster, et al., 1982, Nature, 296:3942), etc. 

The antisense nucleic acids of the invention comprise a sequence 
complementary to at least a portion of an RNA transcript of a TCL-lb, TNG1 or 
TNG2 gene, preferably a human TCL-lb, TNG1 or TNG2 gene (SEQ. ID. NO: 40, 
45, or 46, respectively). However, absolute complementarity, although preferred, is 
not required. A sequence "complementary to at least a portion of an RNA," as 
referred to herein, means a sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex; in the case of double-stranded 
TCL-lb, TNG1 or TNG2 antisense nucleic acids, a single strand of the duplex DNA 
may thus be tested, or triplex formation are assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense 
nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base 
mismatches with a TCL-lb, TNG1 or TNG2 RNA it may contain and still form a 
stable duplex (or triplex, as the case are). One skilled in the art can ascertain a 
tolerable degree of mismatch by use of standard procedures to determine the melting 
point of the hybridized complex. 

The TCL-lb antisense nucleic acids are used to treat (or prevent) T-cell 
malignancies, of a cell type which has been shown to express TCL-lb, TNG1 or 
TNG2 RNA. Malignant, neoplastic, and pre-neoplastic cells which are tested for 
such expression include, but are not limited, to those described supra. In a preferred 
embodiment, a single-stranded DNA antisense TCL-lb, TNG1 or TNG2 
oligonucleotide is used, respectively. 

Malignant (particularly, tumor) cell types which express TCL-lb, TNG1 or 
TNG2 RNA is identified by various methods known in the art. Such methods include 
but are not limited to hybridization with a TCL-lb, TNG1 or TNG2 -specific 
nucleic acid (e.g., by Northern hybridization, dot blot hybridization, in situ 
hybridization), observing the ability of RNA from the cell type to be translated in 
vitro into Tel- lb, Tngl or Tng2, respectvively. In a preferred aspect, primary 
tumor tissue from a patient is assayed for TCL-lb, TNG1 or TNG2 expression prior 
to treatment. 
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Pharmaceutical compositions of the invention, comprising an effective 
amount of a TCL-lb, TNG1 or TNG2 antisense nucleic acid in a pharmaceutical^ 
acceptable carrier, is administered to a patient having a malignancy which is of a 
type that expresses TCL-lb, TNG1 or TNG2 RNA. 

The amount of TCL-lb, TNG1 or TNG2 antisense nucleic acid which will 
be effective in the treatment of a particular disease state or condition will depend on 
the nature of the disease state or condition, and is determined by standard clinical 
techniques. Where possible, it is desirable to determine the antisense cytotoxicity of 
the tumor type to be treated in vitro, and then in useful animal model systems prior 

to testing and use in humans. 

In a specific embodiment, pharmaceutical compositions comprising TCL-lb, 
TNG1 or TNG2 antisense nucleic acids are administered via liposomes, 
microparticles, or microcapsules. In various embodiments of the invention, it are 
useful to use such compositions to achieve sustained release of the TCL-lb, TNG1 
or TNG2 antisense nucleic acids. In a specific embodiment, it are desirable to utilize 
liposomes targeted via antibodies to specific identifiable tumor antigens (Leonetti, et 
al., 1990, Proc Natl Acad Sci USA, 87:2448-2451; Renneisen, et al., 1990, J Biol 
Chem, 265:16337-16342). 
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CLAIMS 

What is claimed is: 

1. An isolated nucleic acid comprising a nucleotide sequence 
encoding a Tel- lb protein, wherein said nucleotide sequence is a 
cDNA sequence. 

2. The isolated nucleic acid of claim 1, wherein said nucleotide 
sequence encodes a human Tel- lb protein having an amino acid 
sequence of SEQ ID NO: 39 from amino acid number 1 to 128. 

3. An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion encoding a Tel- lb protein 
fragment. 

4. An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion of the sequence depicted in 
SEQ ID NO: 40. 

5. The isolated nucleic acid of claim 1, comprising a nucleotide 
sequence of SEQ ID NO: 38 from nucleotide number lto 1152. 

6. A Tel- lb protein. 

7. The isolated Tcl-lb protein of claim 6, comprising an amino acid 
sequence of SEQ. ID. NO: 39 from amino acid 1-128. 

8. An isolated nucleic acid, comprising a sequence encoding a 
fragment of a protein having an amino sequence of SEQ ID NO: 39 
from amino acid number 1 to 128, which fragment can be specifically 
bound by an antibody to a Tcl-lb protein. 
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9. A recombinant DNA vector, comprising a nucleotide sequence that 
encodes a Tel- lb protein, wherein said nucleotide sequence is a 
cDNA sequence. 

10. A host cell that contains said recombinant DNA vector of claim 
7. 

11. The recombinant DNA vector of claim 7, wherein the nucleotide 
sequence encodes a human Tcl-lb protein having an amino acid 
sequence of SEQ ID NO:39 from amino acid number 1 to 128. 

12. An isolated nucleic acid of not more than 50 kilobases which 
contains at least a 50 nucleotide portion of SEQ ID NO: 40. 

13. An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
the cDNA sequence of SEQ ID NO: 38, said nucleic acid containing 
at least an 25 nucleotide portion of SEQ ID NO: 38. 

14 An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
a cDNA sequence that encodes a Tcl-lb protein, which protein has an 
amino acid sequence of SEQ ID NO: 39, and said nucleic acid 
containing at least an 25 nucleotide portion of SEQ ID NO: 38. 

15. An antisense molecule, comprising a nucleotide sequence 
complementary to at least a part of a coding sequence of a Tcl-lb 
protein, which is hybridizable to a Tcl-lb mRNA. 
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16. The antisense molecule of claim 15, wherein said nucleotide 
sequence is complementary to a least a part of the sequence depicted 
inSEQ. ID. NO: 38. 

17. A fusion protein comprising a Tel- lb protein sequence of at least 
10 amino acids linked to a non-Tcl-lb protein sequence. 

18. An antibody which binds to an epitope of a TcMb protien. 

19. An isolated protein comprising an amino acid sequence having at 
least 70% amino acid sequence identity to an amino acid sequence 
depicted in SEQ. ID. NO: 39, over a contiguous sequence of at least 
25 amino acids. 

20. A method for detecting a target sequence indicative of a 
chromosome 14 abnormality in a sample, comprising the steps of: 

a) amplifying said target sequence in said sample using a first 
primer of 18 to 25 nucleotides complementary to a TCL- 
lb nucleotide sequence of SEQ. ID. NO: 38, and a 
sencond primer complementary to a region telomeric or 
centromeric, preferably from a T-cell receptor a/8 locus, 
to said Tel- lb gene; and 

b) detecting any resulting amplified target sequence in which 
the presence of said amplified target sequence is indicative 
of said chromosome 14 abnormality. 

21. The method of claim 20, wherein said chromosome 14 
abnormality is in a Tcl-lb locus and comprises a t(14:14)(qll:q32) 
translocation or an inv (14)(qll:q32) inversion. 
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22 A host cell that contains a recombinant vector comprising a cDNA 
sequence that encodes a human Tel- lb protein having the amino acid 
sequence of SEQ ID NO: 39 from amino acid number 1 to 128. 

5 23. A host cell that contains a recombinant vector comprising a 

nucleic acid that is capable of hybridizing under stringent conditions 
to a nucleotide sequence that is complementary to a cDNA sequence 
that encodes a Tel- lb protein, which protein has the amino acid 
sequence of SEQ ID NO:39, and said nucleic acid containing at least 

10 an 25 nucleotide portion of SEQ ID NO : 3 8 . 



2^ JS. A pharmaceutical composition, comprising said antisense 
molecule of claim 15 or 16 in a pharmaceutical^ acceptable carrier. 

15 ^ > 26. A pharmaceutical composition, comprising said antibody of claim 

18 in a pharmaceutically acceptable carrier. 

27. A method for detecting a target nucleotide sequence indicative of 
a chromosome 14 abnormality in a nucleic acid sample, comprising 
20 the steps of: 

a) hybridizing said sample with a nucleic acid probe of not 
more than 10 kilobases, comprising in the range of 15- 
1152 nucleotides complementary to said nucleotide 
sequence of SEQ. ID. NO: 38; and 
25 b) detecting or measuring an amount of any resulting 

hybridization between said probe and said target sequence 
within said sample. 
^ 28. The method of claim 27, wherein said chromosome 14 
abnormality is in a Tel- lb locus and comprises a t(14:14)(qll:q32) 
30 translocation or an inv (14)(qll:q32) inversion. 
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2&. A method for detecting a Tcl-lb protein in a patient sample, 
preferably a human sample, comprising: 

a) contacting said patient sample with an anti-Tcl-lb antibody 
under conditions such that immunospecific binding occurs; 
and 

b) detecting or measuring an amount of any immunospecific 
binding by said antibody. 



2/j • 30, A diagnostic kit, comprising in one or more containers, a pair of 
primers, each having at least 15-25 nucleotides, in which at least one 
of said primers is hybridizable to SEQ. ID. NO: 38 or it complement 
and wherein said primers are capable of priming DNA synthesis in a 
nucleic acid amplification reaction. 



31. A method for treating a disease state associated with a 
chromosome 14 abnormality in a mammal, preferably a human, 
suffering from said disease state associated with said chromosome 14 
abnormality, comprising administering a therapeutically effective 
amount of a Tcl-lb antisense molecule or an anti-Tcl-lb antibody to 
said mammal. 



£ ? , 32. The method of claim 31, wherein said disease state comprises a 
T-cell leukemia or lymphoma and said chromosome 14 abnormality 
comprises a t(14:14)(qll:q32) translocation or an inv (14)(qll:q32) 
inversion. 



7^ 33. An isolated nucleic acid comprising a nucleotide sequence 
encoding a Tngl protein, wherein said nucleotide sequence is a 
cDNA sequence. 
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35 34. The isolated nucleic acid of claim 33, wherein said nucleotide 
sequence encodes a human Tngl protein having an amino acid 
sequence of SEQ ID NO:42 from amino acid number 1 to 141 

$^ J6. An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion encoding a Tngl protein 
fragment. 

An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion of the sequence depicted in 
SEQ ID NO: 45. 



3b 30. The isolated nucleic acid of claim 33, comprising a nucleotide 
sequence of SEQ ID NO:41 from nucleotide number lto 1500. 



^ -38. A Tngl protein. 

34 39. The isolated Tngl protein of claim 38, comprising an amino acid 
sequence of SEQ. ID. NO: 42 from amino acid 1-141. 

c J 4p. An isolated nucleic acid, comprising a sequence encoding a 
fragment of a protein having an amino sequence of SEO ID NO: 42 
from amino acid number 1 to 141, which fragment can be specifically 
bound by an antibody to a Tngl protein. 

T #1. A recombinant DNA vector, comprising a nucleotide sequence 
that encodes a Tngl protein, wherein said nucleotide sequence is a 
cDNA sequence. 

42. A host cell that contains said recombinant DNA vector of claim 
39. 
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^ #3. The recombinant DNA vector of claim 39, wherein the nucleotide 
sequence encodes a human Tngl protein having an amino acid 
sequence of SEQ ID NO:42 from amino acid number 1 to 141. 

^4. An isolated nucleic acid of not more than 50 kilobases which 
contains at least a 50 nucleotide portion of SEQ ID NO: 45. 

U H 45. An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
the cDNA sequence of SEQ ID NO:41, said nucleic acid containing 
at least an 25 nucleotide portion of SEQ ID NO:41 . 



v)f' 46 An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
a cDNA sequence that encodes a Tngl protein, which protein has an 
amino acid sequence of SEQ. ID. NO: 42, and said nucleic acid 
containing at least an 25 nucleotide portion of SEQ. ID. NO: 41. 

4 & 47". An antisense molecule, comprising a nucleotide sequence 
complementary to at least a part of a coding sequence of a Tngl 
protein, which is hybridizable to a Tngl mRNA. 

^ 7 48, The antisense molecule of claim 47, wherein said nucleotide 
sequence is complementary to a least a part of the sequence depicted 
in SEQ. ID. NO:41. 



49. A fusion protein comprising a Tngl protein sequence of at least 
10 amino acids linked to a non- Tngl protein sequence. 

^"5- An antibody which binds to an epitope of a Tngl protien. 
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fa $1. An isolated protein comprising an amino acid sequence having at 
least 70% amino acid sequence identity to an amino acid sequence 
depicted in SEQ. ID. NO: 42, over a contiguous sequence of at least 
25 amino acids. 

^1 52. A method for detecting a target sequence indicative of a 
chromosome 14 abnormality in a sample, comprising the steps of: 

a) amplifying said target sequence in said sample using a first 
primer of 18 to 25 nucleotides complementary to a TNG1 
nucleotide sequence of SEQ. ID. NO: 41, and a second 
primer complementary to a region telomeric or 
centromeric, preferably from a T-cell receptor a/5 locus, 
to said Tngl gene; and 

b) detecting any resulting amplified target sequence in which 
the presence of said amplified target sequence is indicative 
of said chromosome 14 abnormality. 

fa 53. The method of claim 52, wherein said chromosome 14 
abnormality is in a Tngl locus and comprises a t(14:14)(qll:q32) 
translocation or an inv (14)(qll:q32) inversion. 

. 54 A host cell that contains a recombinant vector comprising a cDNA 
sequence that encodes a human Tngl protein having the amino acid 
sequence of SEQ. ID. NO: 42 from amino acid number 1 to 141. 

^ S$. A host cell that contains a recombinant vector comprising a 
nucleic acid that is capable of hybridizing under stringent conditions 
to a nucleotide sequence that is complementary to a cDNA sequence 
that encodes a Tngl protein, which protein has the amino acid 
sequence of SEQ. ID. NO: 42, and said nucleic acid containing at 
least an 25 nucleotide portion of SEQ. ID. NO: 41. 
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0*$6* A pharmaceutical composition, comprising said antisense 
molecule of claim 47 or 48 in a pharmaceutical^ acceptable carrier. 

<5%*- A pharmaceutical composition, comprising said antibody of claim 
50 in a pharmaceutical^ acceptable carrier. 

ffi J8. A method for detecting a target nucleotide sequence indicative of 
a chromosome 14 abnormality in a nucleic acid sample, comprising 
the steps of: 

a) hybridizing said sample with a nucleic acid probe of not 
more than 10 kilobases, comprising in the range of 15- 
1500 nucleotides complementary to said nucleotide 
sequence of SEQ. ID. NO: 41; and 

b) detecting or measuring an amount of any resulting 
hybridization between said probe and said target sequence 
within said sample. 

0* The method of claim 58, wherein said chromosome 14 
abnormality is in a Tngl locus and comprises a t(14:14)(qll:q32) 
translocation or an inv (14)(qll:q32) inversion. 



A method for detecting a Tngl protein in a patient sample, 
preferably a human sample, comprising: 

a) contacting said patient sample with an anti- Tngl antibody 
under conditions such that immunospecific binding occurs; 
and 

b) detecting or measuring an amount of any immunospecific 
binding by said antibody. 

Qt> A diagnostic kit, comprising in one or more containers, a pair of 
primers, each having at least 15-25 nucleotides, in which at least one 
of said primers is hybridizable to SEQ. ID. NO: 41 or it complement 
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and wherein said primers are capable of priming DNA synthesis in a 
nucleic acid amplification reaction. 

£>| £2. A method for treating a disease state associated with a 
chromosome 14 abnormality in a mammal, preferably a human, 
suffering from said disease state associated with said chromosome 14 
abnormality, comprising administering a therapeutically effective 
amount of a Tngl antisense molecule or an anti- Tngl antibody to 
said mammal. 

^ 68. The method of claim 62, wherein said disease state comprises a 
T-cell leukemia or lymphoma and said chromosome 14 abnormality 
comprises a t(14:14)(qll:q32) translocation or an inv (14)(qll:q32) 
inversion. 

y$ 64. An isolated nucleic acid comprising a nucleotide sequence 
encoding a Tng2 protein, wherein said nucleotide sequence is a 
cDNA sequence. 

{ft -65. The isolated nucleic acid of claim 64, wherein said nucleotide 
sequence encodes a human Tng2 protein having an amino acid 
sequence of SEQ. ID. NO: 44 from amino acid number 1 to 110. 

4jT 66. An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion encoding a Tng2 protein 
fragment. 

4> An isolated nucleic acid of not more than 50 kilobases which 
contains at least an 18 nucleotide portion of the sequence depicted in 
SEQ. ID. NO: 46. 
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The isolated nucleic acid of claim 64, comprising a nucleotide 
sequence of SEQ ID NO: 43 from nucleotide number lto XXX. 

gf A Tng2 protein. 

4?, 76C The isolated Tng2 protein of claim 69, comprising an amino acid 
sequence of SEQ. ID. NO: 44 from amino acid 1-110. 

71. An isolated nucleic acid, comprising a sequence encoding a 
fragment of a protein having an amino sequence of SEQ. ID. NO: 44 
from amino acid number 1 to 110, which fragment can be specifically 
bound by an antibody to a Tng2 protein. 

J72. A recombinant DNA vector, comprising a nucleotide sequence 
that encodes a Tng2 protein, wherein said nucleotide sequence is a 
cDNA sequence. 

1$. A host cell that contains said recombinant DNA vector of claim 
70. 

7? 74. The recombinant DNA vector of claim 70, wherein the nucleotide 
sequence encodes a human Tng2 protein having an amino acid 
sequence of SEQ ID NO: 44 from amino acid number 1 to 110. 

7 iJ 75. An isolated nucleic acid of not more than 50 kilobases which 
contains at least a 50 nucleotide portion of SEQ ID NO: 46. 

7;T 76. An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
the cDNA sequence of SEQ ID NO: 43, said nucleic acid containing 
at least an 25 nucleotide portion of SEQ ID NO: 43. 
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An isolated nucleic acid that is capable of hybridizing under 
stringent conditions to a nucleotide sequence that is complementary to 
a cDNA sequence that encodes a Tng2 protein, which protein has an 
amino acid sequence of SEQ ID NO: 44, and said nucleic acid 
containing at least an 25 nucleotide portion of SEQ ID NO: 43. 

41^8. An antisense molecule, comprising a nucleotide sequence 
complementary to at least a part of a coding sequence of a Tng2 
protein, which is hybridizable to a Tng2 mRNA. 

s$%9. The antisense molecule of claim 78, wherein said nucleotide 
sequence is complementary to a least a part of the sequence depicted 
in SEQ. ID. NO: 43. 

15 fi $6. A fusion protein comprising a Tng2 protein sequence of at least 

10 amino acids linked to a non- Tng2 protein sequence. 

w §1. An antibody which binds to an epitope of a Tng2 protien. 

20 j\ 82. An isolated protein comprising an amino acid sequence having at 

least 70% amino acid sequence identity to an amino acid sequence 
depicted in SEQ. ID. NO: 44, over a contiguous sequence of at least 
25 amino acids. 



25 TJ^' A method for detecting a target sequence indicative of a 

chromosome 14 abnormality in a sample, comprising the steps of: 

a) amplifying said target sequence in said sample using a first 
primer of 18 to 25 nucleotides complementary to a 
TNG2nucleotide sequence of SEQ. ID. NO: 43, and a 
30 sencond primer complementary to a region telomeric or 

centromeric, preferably from a T-cell receptor a/8 locus, 
to said Tng2 gene; and 
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b) detecting any resulting amplified target sequence in which 
the presence of said amplified target sequence is indicative 
of said chromosome 14 abnormality. 



5 f?. Si. The method of claim 83, wherein said chromosome 14 

abnormality is in a Tng2 locus and comprises a t(14:14)(qll:q32) 
translocation or an inv (14)(qll:q32) inversion. 

f Y 85*. A host cell that contains a recombinant vector comprising a 
10 cDNA sequence that encodes a human Tng2 protein having the amino 

acid sequence of SEQ ID NO: 44 from amino acid number 1 to 110. 



ff'fft. A host cell that contains a recombinant vector comprising a 
nucleic acid that is capable of hybridizing under stringent conditions 
to a nucleotide sequence that is complementary to a cDNA sequence 
that encodes a Tng2 protein, which protein has the amino acid 
sequence of SEQ ID NO: 44, and said nucleic acid containing at least 
an 25 nucleotide portion of SEQ ID NO: 43. 

fi> 87. A pharmaceutical composition, comprising said antisense 
molecule of claim 78 or 79 in a pharmaceutical^ acceptable carrier. 

97 A pharmaceutical composition, comprising said antibody of claim 
80 in a pharmaceutical^ acceptable carrier. 

25 

^ ST. A method for detecting a target nucleotide sequence indicative of 
a chromosome 14 abnormality in a nucleic acid sample, comprising 
the steps of: 

a) hybridizing said sample with a nucleic acid probe of not 
30 more than 10 kilobases, comprising in the range of 15- 

2000 nucleotides complementary to said nucleotide 
sequence of SEQ. ID. NO: 43; and 




78 



CRO01.NP003 

b) detecting or measuring an amount of any resulting 
hybridization between said probe and said target sequence 
within said sample. 

$q 90. The method of claim 89, wherein said chromosome 14 
abnormality is in a Tng2 locus and comprises a t(14:14)(qll:q32) 
translocation or an inv (14)(qll:q32) inversion. 

^$91. A method for detecting a Tng2 protein in a patient sample, 
preferably a human sample, comprising: 

a contacting said patient sample with an anti- Tng2 antibody 
under conditions such that immunospecific binding occurs; 
and 

c) detecting or measuring an amount of any immunospecific 
binding by said antibody. 

«=?/ 92. A diagnostic kit, comprising in one or more containers, a pair of 
primers, each having at least 15-25 nucleotides, in which at least one 
of said primers is hybridizable to SEQ. ID. NO: 43 or it complement 
and wherein said primers are capable of priming DNA synthesis in a 
nucleic acid amplification reaction. 

*f Zj 93. A method for treating a disease state associated with a 
chromosome 14 abnormality in a mammal, preferably a human, 
suffering from said disease state associated with said chromosome 14 
abnormality, comprising administering a therapeutically effective 
amount of a Tng2 antisense molecule or an anti- Tng2 antibody to 
said mammal. 

°l? 94. The method of claim 93, wherein said disease state comprises a 
T-cell leukemia or lymphoma and said chromosome 14 abnormality 
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comprises a t(14:14)(qll:q32) translocation or an inv (14)(qll:q32) 
inversion. 
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ABSTRACT 

The TCL1 gene family, located on the human chromosome at the 14q32.1 
locus, are implicated in the development of T-cell malignancies. The present 

5 invention discloses the identification and characterization of new members of this 
gene family, the TCL-lb, TNG1 and TNG2 genes. The TCL-lb, TNG1 and TNG2 
gene sequences are expressed at very low levels in normal bone marrow and 
peripheral lymphocytes, but are activated in T-cell leukemia and lymphoma by 
rearrangements of the 14q32.1 locus. The present invention relates to the 

10 identification of these chromosome 14 abnormalities, and methods for detecting and 
treating any T-cell malignancies that develop, as well as preventing the development 
of these T-cell malignancies. 
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SEQUENCE LISTING 



<110> Croce, Carlo 

<120> TCL-lb Gene and Protein and Related Methods and 
Compositions 

<130> CRO01.NP003 

<140> 
<141> 

<150> PROVISION 60/124,714 
<151> 1999-03-15 

<160> 63 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR primer 
<400> 1 

ggcagctcta ccccgggatg aa 

<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 2 

acagacctga gtgggacagg a 

<210> 3 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 3 

tcctccttgg caggagtggt a 

<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 4 

cagttacggg tgctcttgcg t 

<210> 5 
<21i> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

atggcctccg aagcttctgt g 

<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 6 

tggtcgtgcg gttcaatccc t 

<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence :TC5 
<400> 7 

aatctggcca tggtctgcta tttc 

<210> 3 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
for TNG1 

<400> 8 

tgcatccctc cagccaagga t 



<210> 9 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
for TNG1 

<400> 9 

tggcctgcag aggctctcaa g 

<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
for TNG2 

<400> 10 

gtgcctgtct cattcgcctc tg 



<210> 11 
<211> 23 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
for TNG2 

<400> 11 

agtgggcaca tgttacagca ttc 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
<400> 12 

gcatccagga ctgtgccagc a 

<210> 13 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : RT-PCR primer 
<400> 13 

ttctgttagc cttgctgtcc gt 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 3 1 RACE primer 
for TNG1 



<400> 14 

ttgaacccag gtctcgtctg ac 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 5 f RACE primer 
for TNG1 

<400> 15 

aacgtaggat gtgcacagag ca 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :murine Tcllb 
primer 

<400> 16 

gagaacggtc aggacccaaa cc 

<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 17 

caggctatca agacctttac tc 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 



<400> 18 

tcaacctcgc atattactat gtc 



<210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 19 

caaaggcaca aagtgagcaa gag 

<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 20 

aatgttggaa acttctcact cat 

<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 21 

actggaaact tgttctcatt cac 



<210> 22 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence -.murine Tcllb 
primer 

<400> 22 

cacttgcagc atatgaccac aat 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 23 

cctggtctgc acaagagatg a 



<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 24 

ctgtccactt gtggaagtta at 

<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 25 

cacttgtggc agatgaccag ata 
<210> 26 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 26 

ccaggagcct actccccagc ag 

<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb 
primer 

<400> 27 

gtggcagatg accacactct t 



<210> 28 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : muring Tcllb3 
primer 

<400> 28 

cattactatg gctgattcag ttc 

<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tcllb3 
primer 

<400> 29 



8 



ggaatgagac tctcagggca c 



<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murien Tell 
primer 

<400> 30 

cctgggcaag gcagacagga gc 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tell 
primer 

<400> 31 

tgcttcttgc tettategga tg 

<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : murine Tell 
primer 

<400> 32 

ttcatcgttg gactccgagt c 



<210> 33 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Description of Artificial 
primer 

<400> 33 

aattccaggt gatcttgcgc c 

<210> 34 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
primer 

<400> 34 

gtaccaccag acagcactgt g 

<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
primer 

<400> 35 

gacccagatc atgtttgaga cc 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
primer 

<400> 36 

aagccatcta taaggtcagg 

<210> 37 
<211> 24 



Sequence :murien Tell 



21 



Sequence : actin RT-PCR 



21 



Sequence : actin RT-PCR 



22 



Sequence rmurine RACE 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : PFGE primer for 
mouse 

<400> 37 

tgctaggacc agctgctcca taga 24 



<210> 38 

<211> 1152 

<212> DNA 

<213> Homo sapiens 

<400> 38 

gaggcgggtc ccggttgcag acttgccatg gcctccgaag cttctgtgcg tctaggggtg 60 
ccccctggcc gtctgtggat ccagaggcct ggcatctacg aagatgagga ggggagaacc 120 
tgggtgactg tggtcgtgcg gttcaatccc tcgcgtaggg aatgggccag ggcctcccag 180 
ggcagcagat atgaacccag catcacagtg cacttgtggc agatggcagt gcatacccgg 24 0 
gagctactct cctccggcca gatgcccttc tcccagctgc ccgccgtgtg gcagctctac 300 
cccgggagga agtaccgagc agcggattcc agtttctggg aaatagcaga ccatggccag 360 
attgactcta tggagcagct ggtcctaaca tatcagccgg agaggaaaga ctgacactgg 420 
gagtggctgg ccctgctggc cctgcctctt ctggcctggt gtctcctcat gccccctcag 480 
tgaggatctt catgtacctg ctcttctgtt tgcacaccca gcatagcctc cttgcaggca 540 
gaaggcagta gggcccctgc acactcagtt tctctcgttt tccttagtta tcagtcctgt 600 
cctgtcccac tcaggtctgt acttagggca gctggcctgg atgggcttca ctggggccct 660 
gtctgtgtgc tgagccagtt tcccctgctg gctgcaagct gtgggttctt tctcctctgt 720 
gcccctcatg ctgatcttct agatgccact cccaaatccc cttcataccc accaggatgt 780 
gtgcccagcc aggcctccag cacccccagt gcagctcgtg attggaaact caccatcggc 840 
aggcagtggt tcggtttaag agatggcatt agagggagcc cagtctggat gtggacttgg 900 
atgccctgtg ggtatcagtt ctgctgacac tttggcccga aatagatcca gtgctgagca 960 
agcaatgtac accggagcct cagtgagccc atctgcacag tggggagcat ggagggatgg 1020 
gtttggcctg tgcttctgct tattcagtcc ttcagctcac ggaagggatg ctagtccgtg 1080 
aaggtgacct cacagtactg gttaattaaa ctttattgct cactgtcaaa aaaaaaaaaa 1140 
aaaaaaaaaa aa 1152 



<210> 39 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 39 

Met Ala Ser Glu Ala Ser Val Arg Leu Gly Val Pro Pro Gly Arg Leu 
15 10 15 
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Trp lie Gin Arg 

20 

Val Thr Val Val 
35 

Ala Ser Gin Gly 
50 

Gin Met Ala Val 
65 

Phe Ser Gin Leu 



Arg Ala Ala Asp 

100 

Asp Ser Met Glu 
115 



Pro Gly lie Tyr 



Val Arg Phe Asn 

40 

Ser Arg Tyr Glu 
55 

His Thr Arg Glu 
70 

Pro Ala Val Trp 
85 

Ser Ser Phe Trp 



Gin Leu Val Leu 

120 



Glu Asp Glu Glu 
25 

Pro Ser Arg Arg 



Pro Ser He Thr 

60 

Leu Leu Ser Ser 
75 

Gin Leu Tyr Pro 
90 

Glu He Ala Asp 
105 

Thr Tyr Gin Pro 



Gly Arg Thr Trp 
30 

Glu Trp Ala Arg 
45 

Val His Leu Trp 



Gly Gin Met Pro 

80 

Gly Arg Lys Tyr 
95 

His Gly Gin He 
110 

Glu Arg Lys Asp 



<210> 40 

<211> 6486 

<212> DNA 

<213> Homo sapiens 

<400> 40 

tcctcctcct ccccctcctc ccccgactgg caccgccccc actgccggcc ccgcccccac 60 
tgccggcccg ggccccaccc acgccggagc tgctccattt aaggagattg cgcagctgga 120 
aagctacacg tgtgagccta gaggcgggtc ccggttgcag acttgccatg gcctccgaag 180 
cttctgtgcg tctaggggtg ccccctggcc gtctgtggat ccagaggcct ggcatctacg 240 
aagatgagga ggggagaacc tgggtgactg tggtcgtgcg gttcaatccc tcgcgtaggg 300 
aatgggccag ggcctcccag ggcagcagag tgagtcctgg gcacgagggg aggctgtggg 360 
gagggctgcg cactgacccc tgcccgtgtg ggaccgcggt gggggtcaga gggggccgtt 420 
ctcacccgca ctggaaaact cacttctgtg caggtctagg agcgcagcaa tgtccatgcc 480 
cagccctggc cccaggaaca ccccccgtaa agggaccaca ggcacaagct tatccacatg 540 
agataatgtg gtcctgcgtg gtgaagccga ggctaaggta gctcagggct tagtgccatt 600 
cccagtgcct gctgggaagg cccacaaatg gggcagctat tgagctgggc tttgtgggat 660 
gagtaggagt tctccaggtc tagaaaggag gcaggagtag tataagcaaa agcattgcag 720 
cctggaggca ccaggtgggc caacaggatg aacatgacat tggtgtcaga ttactgatct 780 
gcaaaatgag aataatatac ctctgtggca agctagtcac agacatgctc acatacatgg 840 
ctcaccgcct gaatggcctg gggaagcatt tgactgataa cagattctgg aaattaattc 900 
aggaggcttg ggtggagtcc tagattcttt acttttcaaa agctccccag gtgataatga 960 
taatgactca ggaaacggct gtagatgagg gctttagatc acagccagtc tttgagggat 1020 



12 



gaagtaaata cagtagcgtc tctggtgtgg gtggcggtgg ggaattgatt ccaggaccga 1080 
ctgtggatgc tcaagtccct gatagaaaat gacctgggta gtaattacat ataacctcag 1140 
cgcatcctct actatatttg aaatcagatt actaataaca cctaatgcta cacctacaca 1200 
tcacttcaag ctctgctttt gggaactttg tggaatttct ttttttcccc aaatattttt 1260 
aatctgaggt tagtcgaatt catgggtgca gtatccatgg aaatgggggg ctggctgtac 1320 
cttagtgtaa tgtggtaaaa gcatatccgg atatttaaaa tgccatttag ggctgggcgc 1380 
ggtggctcac gcctgtaatc ccagcacttt gggaggccga gatgggctta tcacgagatc 1440 
aggagatcga gaccatccta gccaacatgg ggaaaccccg tctctactaa aaatacaaaa 1500 
aattagccgg gcgtggtggc ggacgcctgt agtcccagct actcgggagg ctgaggcagg 1560 
agaatggcgt gaacccggga ggcgaagctt gtagtgagcc gagatcgcac cactgcactc 1620 
cagcctgggt gacagagtga gactccgtcc caaaaaaaaa aaaaaaaaaa aatgccgttt 1680 
aggtcttcgt aaacaattca ctgcctgttt gtttgttttt tgagaaagtc ttgctctgtt 1740 
gcggctggag tgcactggtg tgatgttggc tcactgcaac ctccacctcc caggctcaag 1800 
tgattctcat gcctcagcct cccgagtagc ttggattaca ggcgattttt ttttacagtt 1860 
aatttttttt gttattttca ggagagacaa aagtttaatc atgtgggcca ggctggtttt 1920 
gaactcctga cctcaagtga tctgcccacc ttggcctccc aaagtgctgg gattacaggt 1980 
gagccacctc gcccagccag ttcactgaca ctttaaacaa tataacacat ttcctaaaaa 2040 
aagttcaaat aggttatttc aaaaaatgtt ggtagagaac atggaaaggc ttttctgtac 2100 
atacactaaa taaagcatgc aaaaattgtg gagcaaatat tttaagtttt tcaaaagcct 2160 
gaaaaagtgt taatggaggg cactgtaaaa tggtgcagcc actatggaaa acaggatgag 2220 
gatttctcaa aaaaagaatt acggcataat ccagcaatgc cacttctgga tatataccca 2280 
caagactctg aagccggaac ttaagcatgt attcatacat ccatgttcac agcagtatca 2340 
ttcatactag ccaaaaggtg gtggcagccc ccgtgtccat tgatagatga atgggtaaac 2400 
aacacaaacc atgaagtatt cacccttaaa agtcagacac acggatgaaa cttggagcca 24 60 
ttatactaaa tgaaatatgc cagtcacgga aggacagatt ctcttgtatg aggtactcag 2520 
agtggtctca ttcataaagt ggaatggtag ctgccagggg ctggagggag tcgaggatgg 2580 
gaagttaatg ttagtaacag gtacggagtc tcagtttggg aagataaaaa gttctggagg 2640 
tggatagtgc cgacggttcc acatgtcaat gcacttaatg ccaccaaact gtactcttaa 2700 
aaacagttga ccgggcacgg tggctcacgc ctgaatccca gcactttggg ggaccgaggc 2760 
gggcggatca caaggtcagg agatcgagac catcctggct aacacggtga aaccccgtct 2820 
ctactaaaaa tacaaaagaa ttagccgggt gcggtggcgg gcgtctgtag tcccagctac 2880 
tcggggggct gaggcaggag aatggcttga acctgggagg cggagcttgc agtgagctga 2940 
gatccagcca ctgcactcca gcctgggcga cagagcaaga ctccgtctca aacaaaacaa 3000 
agcaaaacaa aaaaaacagt taagattttt ttttttttta aatgattcag tggaaataga 3060 
atggattctt caaataactt agccacgggt gggataaggg acctacttag taagtatttt 3120 
ttccccttct ttcttaaaaa tagatcgatg tcttagggtg ggaattaggc ttcctgggcg 3180 
acacatctaa tgcaaagatc agccaccttt ttctgtaaag gatctgatgg taaacatttt 3240 
ccacttgaga gctatgctct tgcagctact cagctctgct attgcagtgc aaaagcagct 3300 
aaaggcaacg gtaaaggaat gacggaagga gccttagttt atttacaata aagctttatt 3360 
tgcaaaagca gatgcaagcc agacttagtt tgctgatctc tgatctacag tcagaataca 3420 
cagagaagga gagattttgc cgtataattt aaaatacttc tctttgcaaa agcagtccat 3480 
aaaaaaagtg aggacaacaa actgagaaaa attattcaca acatgtctga ttgatagagc 3540 
actaatattc ttaattcaaa aagacatttt atcacaaaag aagacaaata cttagaaaat 3600 
tgtgcaaaag actttccatt ttgttgcata acgtaggaag ctttggtttt acttttccta 3660 
tcatctttct aacttccagt accagcctaa ttttgttatt tttattatta tgtatttatt 3720 
ttgagacaga gtcttgctct gtctcccagg ctggagtgca gtgacctgac gatagcttac 3780 
aacagcctct acctcccagg ttcaagaaat cttctcacct tagcttcccg agtagctggg 3840 
actgtaggca catgccacca tggccagcta attttttatt ttttgtagag acagagtctc 3900 



attatgttgc cgaggctggt cttgaactcg 
cccaaagtgt tgggattaca ggcataagcc 
tataagtgtg tgaaggtcat gatcagaact 
cctcagatcc aggagtccat ggatatcttg 
caatggcaga gatggagccc caagagaata 
ggaaacagtg gctaaatttg gggattgcag 
gccagaaggg ccccagccat cctagatgga 
ctggaattct gaaaacattt acctctgacc 
agcccagttg gcagggaacc ctatccatga 
tgaacccagc atcacagtgc acttgtggca 
ctccggccag atgcccttct cccagctgcc 
gtaccgagca gcggattcca gtttctggga 
gtggttctag gtgaaagcga caggtggccc 
tgcccctggc ccccttgggg ttcttgtctg 
aggaggcctg agtgtgtgtg ggtggatcgg 
agtgggtgag ggagggaggg ttgccttccc 
ccacctgcct cacccctgcc tgctgctgct 
actggaagag catctcccag aggttctgat 
tggagcagct ggtcctaaca tatcagccgg 
tatgttgggg ccctgtgcgt ctcggtgtag 
cttttcagaa agacggcgtg gcctcctcct 
agccacctgt cacctctgtt ccccagcccc 
cccataataa gttcctaaag catgggatct 
tgagaatttt gtgaggtgta cgtgttaatg 
ggggatggga atgacttcct cgagaccata 
ctcgggtccc tagatttaac catggcactg 
ccagcactga cccatagagg gctcctctca 
gggcctggtc cattggaggg gctggcactg 
gggtgccggg ctcccccata gttcactgac 
tggccctgcc tcttctggcc tggtgtctcc 
cctgctcttc tgtttgcaca cccagcatag 
ctgcacactc agtttctctc gttttcctta 
ctgtacttag ggcagctggc ctggatgggc 
agtttcccct gctggctgca agctgtgggt 
ttctagatgc cactcccaaa tccccttcat 
ccagcacccc cagtgcagct cgtgattgga 
taagagatgg cattagaggg agcccagtct 
agttctgctg acactttggc ccgaaataga 
gcctcagtga gcccatctgc acagtgggga 
tgcttattca gtccttcagc tcacggaagg 
actggttaat taaactttat tgctcactgt 
tgacctcttt ctagcataga aatggcagct 
aatgatccat ttcatatttt tccacactgg 
caggac 



tggcttcnag cagtcctcct gccttggcct 3960 
actgctccca gccttatttc gtatatttac 4020 
gccatatatt ttggcgggaa aatctatcac 4080 
tttttaaaac gaagatttaa aaaattacgg 4140 
ctcagcttta acccaaggtg ttgacaggtt 4200 
tggggcgagg cagggtgcag gtcagagggg 4260 
gccacaagta ccagtgccaa ggctcttggt 4320 
ctggcagccc actggccatt gcttgtgtgc 4380 
tttgccgcct cttttctggt cccttcagta 4440 
gatggcagtg catacccggg agctactctc 4500 
cgccgtgtgg cagctctacc ccgggaggaa 4560 
aatagcagac catggccagg caagtgtgtg 4 620 
ctggtgactg ccgtggccct ctctcttctg 4 680 
tcctcttcct gttgctcaag tcttccttca 4740 
tgcatgagtt cccatgtggg atgcaggcag 4800 
tgggctaggg aaatccataa gctggagttc 4860 
gccagcctgc atgggcggcc gttaaggcca 4 920 
ggctgctccc tctcctgcag attgactcta 4980 
agaggaaaga ctgacactgg gagtggctgg 5040 
ggatcagacg aaagtgagaa gacctctcct 5100 
ccctgctgtt tgctgagatt tttcttacat 5160 
ttggatgtga tggtacacag tgggtgggcc 5220 
catcgaataa gactcatcat ttaatccttg 5280 
tcccatttca cgacgaaaag acaagactct 5340 
cagccaggaa atagcggtga atctagtgat 5400 
aggtgccgtg tgacggtggc cttggaggac 54 60 
gatgggcagc agcttggagc aggccaggca 5520 
gacttgcctt tgaccccagc agcttggatg 5580 
tgtctccttt ggtcttctcg caggccctgc 5640 
tcatgccccc tcagtgagga tcttcatgta 5700 
cctccttgca ggcagaaggc agtagggccc 57 60 
gttatcagtc ctgtcctgtc ccactcaggt 5820 
ttcactgggg ccctgtctgt gtgctgagcc 5880 
tctttctcct ctgtgcccct catgctgatc 5940 
acccaccagg atgtgtgccc agccaggcct 6000 
aactcaccat cggcaggcag tggttcggtt 6060 
ggatgtggac ttggatgccc tgtgggtatc 6120 
tccagtgctg agcaagcaat gtacaccgga 6180 
gcatggaggg atgggtttgg cctgtgcttc 6240 
gatgctagtc cgtgaaggtg acctcacagt 6300 
ccacttttgt gctgaattgg agcctctctt 6360 
tctggtaccg aaatgttaag gtaacatttt 6420 
gaaggaaatt gtgattggtc cattcagcag 6480 

6486 



<210> 41 
<211> 1455 
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<212> DNA 

<213> Homo sapiens 



<400> 41 

gcttgctctg gtgctagggt cagctagaca ggtcggcagg agaccagctg ctctagaaga 60 
gcctggacag gatgaattgg ttctccttcc acgtgcatct tgcatccctc cagccaagga 120 
tttgaaccca ggtctcgtct gactccaaat gcaccaagag gatggagccc agagtgacac 180 
agaggaagag gcccctggat ggatgcatgg ggaagatcac gggaattaca agtgacatcc 240 
tgaagtatga tcacaaatgt ttcaaattaa gccttcctgc taagtttcca gaggtgtgtg 300 
gctcggacga agtgtttcca gaccctgatc tactgcatgt cctgcctgtt gctggttctt 360 
tgcagcagtc cattgaccaa tgctgtctac agcttgagag cctctgcagg ccagggctgc 420 
tctgtgcaca tcctacgtta ctattcaaac tgcacagcag catgaaaaac aggcccttct 480 
tttctctcat ttacacatat gtgaagaaga cacagcaagt gagaaaacgt gaccgaaagc 540 
cacgcggaca ggtggccgca gggccaaacc caacttccgt gatgtgatct ggctacagtt 600 
tggagcccac atatttcaat agttatcccc aggcctggca cgtcaaagag gctccagaaa 660 
tgtttgctga gtgagtgatg aatcagcaat ttctgatttt ctcctaaact tagagaataa 720 
gtggcttgtc tgcagtcacc cagggcgtcc ctgacagagg tacaagtaaa gcccagatct 780 
ctgcatctct ggttcaaggc cttttgtact acacaccgct gtctctttgt catcattaaa 840 
agcaaggctc aagggacatg ttttctagcc ttagttcccc atctacaaaa tgggcctcat 900 
ggaatggaat gtctccactt cactccagca tcaacaagtg gggaattctg atggattcaa 960 
ttcgacttct ttccatgggc gtgttctaag cagcctcttt gttccagaag ctgccctcag 1020 
ccagagttgg ataagccaat cctcactccc cagcctcctc tggataggga tgaagacccc 1080 
actggggttg gaagtgcaga ggcagacagg tgtatggagt cacctgtaaa ttgattcaag 1140 
tgagccagga aagcagcaaa ggaaagagaa acctgagtga cgacgtggtg gaggaacagg 1200 
gctggaaaga ggctgctggc tgtctggctt cgcagctctg gcctcctaat cagcctcgct 1260 
cttgtctctg gtgttctctg gctcttgtcc atctgtctgt gtttcttttt gccagctatt 1320 
gactaatctt tgctgaagct gagctagaat tctggtgttt ataagcaggt aactagctga 1380 
gcactagttg ataactttgg agttagtgga taaccacagg gctgggctac taagcttttt 1440 
cagttgaaaa ataaa 1455 



<210> 42 
<211> 140 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Met Glu Pro Arg Val Thr Gin Arg Lys Arg Pro Leu Asp Gly Cys Met 
15 10 15 

Gly Lys lie Thr Gly lie Thr Ser Asp lie Leu Lys Tyr Asp His Lys 

20 25 30 

Cys Phe Lys Leu Ser Leu Pro Ala Lys Phe Pro Glu Val Cys Gly Ser 
35 40 45 

Asp Glu Val Phe Pro Asp Pro Asp Leu Leu His Val Leu Pro Val Ala 
50 55 60 
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Gly Ser Leu Gin 
65 

Leu Cys Arg Pro 



Leu His Ser Ser 

100 

Tyr Val Lys Lys 
115 

Gly Gin Val Ala 
130 



Gin Ser lie Asp 
70 

Gly Leu Leu Cys 
85 

Met Lys Asn Arg 



Thr Gin Gin Val 

120 

Ala Gly Pro Asn 
135 



Gin Cys Cys Leu 
75 

Ala His Pro Thr 
90 

Pro Phe Phe Ser 
105 

Arg Lys Arg Asp 



Pro Thr Ser Val 

140 



Gin Leu Glu Ser 

80 

Leu Leu Phe Lys 
95 

Leu lie Tyr Thr 
110 

Arg Lys Pro Arg 
125 



<210> 43 

<211> 2078 

<212> DNA 

<213> Homo sapiens 

<400> 43 

ctcccaaagt gctgggatta caggcgtgag ccaccatgcc cggtcacctt tgtcctgttt 60 
tctacaggtt agaagcaagg cacaagtcat acctacattc aaggagaggg gaccatgcaa 120 
aggataaact gcagagagac aacagatttg catcccagac acattcacac tgggccaaga 180 
gcagctgcca ttgcagattc gagtcacgtc cttttttcct tccttctcct tcttggagtt 240 
acaacagaag ctggggaggt gagagtgcag aaaggacgtg gatgaagcag agaggcacgt 300 
gcctgtctca ttcgcctctg gatctgctgc atccaggact gtgccagcac aaagtgggag 360 
cccgataaat atttattgaa gaaatgaaag gtgacaaaag gcagaaggag ggagacccca 420 
aaaagagacc ccaggctcca gaagcagatg aggagacaaa cagaagagca gaagaatggc 480 
agaatggcac agcagagaga agaaaaggaa catctgaatg ttgagaggaa tttggctggg 540 
ggcggttgga gaggagatta gccactggac agccaaactc caggggaaga tcatcttccc 600 
actccatccc ccttccagct ccccatccat cctgctgaga gccacctcca ccactcagtg 660 
aaaccactgc attcatcctt caagtttatg tgtgacctga ttcttcctgg acattggaca 720 
aagacctagg tccagtgaac tgtctaacac ttaagtcatc cacggacagc aaggctaaca 780 
gaatgctgta acatgtgccc acttggactc tgggagttgc agacacccac ccgtaggtgc 840 
tgccatgggt ccggagccca gaaagtgctt gccctggctc ctgcacctgc ccgtctgcat 900 
gctccccctc cggtaagggg tttgaggatg tggcggctga acataagagc tatgcccctg 960 
tcacacatca tgggacaggg gtcagggaac tctcccattt cactttcaaa tattccaggt 1020 
cccctttttg ttcgggagcc agagtctagt ggagacagag atagaaaccc tgaaatagga 1080 
tccaggcaat ataaacctac acgctgtaag ccaagtcaga gtgaggctga tgtgataaca 1140 
aaagtctgaa tacaaggtgg gagacaatta ttctgactga aagggaggca agagttctgc 1200 
cataggcttg atgctgtgtg tccctggctt cctaactcat ttgcaaatgg aaaatactat 1260 
tccattgtag gaaccttctt tttccatttg tctggggaag gagaaagaat ggctgggctg 1320 
aatgaattta tcccttggtc ctcttccaac aaagctcagc tatgaaagat aaatccagct 1380 
ctccccaccc cctctcagtg gagctgggga gaaatcaaaa gcccctctgc caataatgag 1440 
accaaagttt gcaagggcag gacgagcccg tgctaacaga gaaagtgttg tttcctcaat 1500 
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ttggttttag actgtcttgt cctatggggg agaaaagatc tgcccttggg agaggtgcca 1560 
actttataga tctattaata aaagaactgg caggcttaca gttcttgcca atgaggaaac 1620 
ttgaatgaga gaagccaggc tcaaccttgg ccaacagact ggagcccatc accctaactt 1680 
caccccgctt ctccttaccc aaccatcaaa ggctaggcag cacccaccca gcagcttcca 1740 
cctggctgaa gcctgcacct gcttcagacc aagggttaga tggaaatttg gcatgggaag 1800 
agagggctca cctgtgggca ggatagactc tatccaagaa ggagaactga aaaatgaaaa 1860 
cctatgagac aaggggtgat cctgaaggca ggcaggagaa agggctggag ggagaggcac 1920 
tggggaattt ttcctggtga atactgaagt tactagatgt tttgtcttgc aaaactcaag 1980 
ggaaaactct caaactctaa tgtttgtcta ttctgtgtcc aaactgtcct tttgaaacgg 2040 
acccaccttt cagtaaagaa acttgcattg gcctgccc 2078 



<210> 44 
<211> 110 
<212> PRT 

<213> Homo sapiens 
<400> 44 

Met Pro Gly His Leu Cys Pro Val Phe Tyr Arg Leu Glu Ala Arg His 
15 10 15 

Lys Ser Tyr Leu His Ser Arg Arg Gly Asp His Ala Lys Asp Lys Leu 

20 25 30 

Gin Arg Asp Asn Arg Phe Ala Ser Gin Thr His Ser His Trp Ala Lys 
35 40 45 

Ser Ser Cys His Cys Arg Phe Glu Ser Arg Pro Phe Phe Leu Pro Ser 
50 55 60 

Pro Ser Trp Ser Tyr Asn Arg Ser Trp Gly Gly Glu Ser Ala Glu Arg 
65 70 75 80 

Thr Trp Met Lys Gin Arg Gly Thr Cys Leu Ser His Ser Pro Leu Asp 

85 90 95 

Leu Leu His Pro Gly Leu Cys Gin His Lys Val Gly Ala Arg 

100 105 110 



<210> 45 

<211> 4509 

<212> DNA 

<213> Homo sapiens 

<400> 45 

gagcttgctc tggtgctagg gtcagctaga caggtcggca ggagaccagc tgctctagaa 60 
gagcctggac aggatgaatt ggttctcctt ccacgtgcat cttgcatccc tccagccaag 120 
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gatttgaacc caggtctcgt ctgactccaa atgcaccaag aggatggagc ccagagtgac 180 

acagaggaag aggcccctgg atggatgcat ggggaaggtg aggcctggag gtgatgtgct 240 

tgagattacc aggctcttct gaacctgggg gaccttctcc tatgatagcc ctgctcctcc 300 

agacccagca ggtgatttcc ccaaaagccg cttacagtga acgcacctga tagcaataac 360 

ttaagcatcc cctgagaatg acccttatgg caggtgcagg tgaatgtggt ttggagttct 420 

gagctaagga atctgggagt ggccaacctg gagattcatt cctattcctt atctatgagg 480 

aacatctgag ctcctgtccc gtcctgtgca acacggacag tacaggggat caaggccctt 540 

tgttttgggt taggtaaatg ttgccaggtg gacactgttg ggggagggtg ctaagtgaag 600 

atgctatata tgctgcaagc tttttgtaag ttgtgtggct ctcctgccca gcccaccact 660 

gctagatgct ctcccctgca tgtaagcccc agtaaaactc catgtctcct tcagcagccc 720 

tgggtccttt cttcagcctc tcaaacctgc tgccatcccc attggagtag ataggggttc 780 

agcataacag cagccaccag ctggggaaag gcgtggcatt tggagctccc tacatggtct 840 

tgactctaat atggagggaa agaaaacaga ttctgccctg ctcagtaaag ccccctactg 900 

gcctcccacc ctggactgtg gacaaagctt tttttttttt ttgagacgga gtctcactgt 960 

gtcccccagg ctggagtgca gtggcgtgat ctcagctcac tgcaagctcc gcctccccgg 1020 

ttcacgccat tctcctgcct caggctcccg agtagctggg gctacaggcg cctgccacta 1080 

cacccggcta attttttttt ttttttttca tttttagtgg agacggggtt tcatcgtgtt 1140 

agccaggatg gtctcgatct cctacctcgt gatctgccca cctcggcctc ccaaagtgct 1200 

gggattacag gcgtgagcca ctgcgcccgg ccaacaaagc atattttttt cctctgaggt 1260 

ctttacccta taactgcaag tggtttcatg acactgcaca tgggttctta tctggggttg 1320 

gtagattcaa ctcaattcca caggtttact gggtgttgcc tcccatgctt ggagccagga 1380 

ggatgcaggg aaatggggct gccaccatac ccgctcttct tatgatccca tcacatcatt 1440 

gcaccctttg gggcctggca ccatgctctg gcactttgcc tccatcatct cctttaacct 1500 

ctgagctctg tcaggaaggc attactacta tccctggaat acagacagga aaactgagtt 1560 

ggattgaggc tcagagaggt taagtcactt acccaaggtc acaaagctag agagtggcag 1620 

gccagcatcc aaggccagga acacgagctc cctctgatcc aaatggtgac tctatgtttg 1680 

atcagtgctc tgagaactct ccaaagatct ttatcaactg tcatctcttc attgcccagg 1740 

cccagtcagg gagccacttg gctgcggtcc cgctgagatt taggggcagg acccctgatg 1800 

tgaggtggaa tgcttggtca cgctgttcaa gcctccacca gtcactcctg actggttatg 1860 

gacactcaat gtgccccgtc ctgtgtgcaa tgcagtgtgg gaaggggatc aaggaccttg 1920 

gctaatccca ggcatttccc cagaatgtcc caggtgtgga ccccttggga ggtgctggga 1980 

agcaggctgg tgtcccacag gccttggttc catcccagct gtgctgtgct aagctgaaac 2040 

tctttgggca gcttgtttca cctcatgaga cttctttttc tgaactgtta aatgggatga 2100 

ttcgacctct gcttggcttt tggtgacgat taaacgagat attagctgta aagcatttca 2160 

tcccgggtct ggtatagaag aagcacttga taaatgaaag ctctgttgag tatgatgatc 2220 

cgtcacccca ggcatcagtg ggatctcaca gcaggaacaa tgcctaactc ctaactgatg 2280 

cccagtaaac atcgggtgga tgcaatgtcc aacaagacag atattcatgt tcagttttac 2340 

agaaaaggag acagaggcca gaaaaggaag aaagtgccca ggtcagccgg ctagtaagag 2400 

gcagagttga aatttgaacc gggttctggc tggctgcagc tccactgctc cactactgca 24 60 

ctgtctgaga gccaaaaggc accttgtagt caggcaggca gggagctggc ttcttgctgt 2520 

cctgctggat cctgccctag tgtggccctc cctggaacca ggggtggggc tggagggcag 2580 

ggcactgaga gagatctcct gggaactgta ccatctgcac aaagggcacc tgctgggcac 2640 

tctgctgaaa gctgtctgca aacaccactt cctgcctgcc tccctggggg ctctaagctc 2700 

aggagggggt gagggcctat gtctgcacgt ggtgaggccc ctgctgggta ttgtaccaca 2760 

ccttcattgt gctcagggtg gccagtggag gcccttccta gactcgagct ctggtcacaa 2820 

gcccaacccc ctccttgctc ctgtctgggc tcacctggaa tctcctgcct cactctgtgg 2880 

ccagaggcac ctttggaagc ccaggtcacc cctactactc atgcccctca tttggaaatt 2940 

ttcataatct acctcttgtt aatgattcca tcccatctat aatgcttttc tatttttctg 3000 
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aatttttccc cgataggcta taactgctgc aaagatagag aattgcaatg attaatatgc 3060 
atttatgtat ttccaaaaag ttaccagaat atcaaactca tgattttgtc tatagtttgg 3120 
ttagattcaa gctattttta tatcataaaa caacttaaaa tttttttatt tttaattttt 3180 
gtgggtgcag agtgatgtat ttatggggta cccgagatgt tagaatgaag ctattaaatg 324 0 
tgagtgtgaa cgttcacctt tttgcagatc acgggaatta caagtgacat cctgaagtat 3300 
gatcacaaat gtttcaaatt aagccttcct gctaagtttc cagaggtgtg tggctcggac 3360 
gaagtgtttc cagaccctga tctactgcat gtcctgcctg ttgctggttc tttgcagcag 3420 
tccattgacc aatgctgtct acagcttgag agcctctgca ggccagggct gctctgtgca 3480 
catcctacgt tactattcaa actgcacagc agcatgaaaa acaggccctt cttttctctc 3540 
atttacacat atgtgaagaa gacacagcaa gtgagaaaac gtgaccgaaa gccacgcgga 3600 
caggtggccg cagggccaaa cccaacttcc gtgatgtgat ctggctacag tttggagccc 3660 
acatatttca atagttatcc ccaggcctgg cacgtcaaag aggctccaga aatgtttgct 3720 
gagtgagtga tgaatcagca atttctgatt ttctcctaaa cttagagaat aagtggcttg 3780 
tctgcagtca cccagggcgt ccctgacaga ggtacaagta aagcccagat ctctgcatct 3840 
ctggttcaag gccttttgta ctacacaccg ctgtctcttt gtcatcatta aaagcaaggc 3900 
tcaagggaca tgttttctag ccttagttcc ccatctacaa aatgggcctc atggaatgga 3960 
atgtctccac ttcactccag catcaacaag tggggaattc tgatggattc aattcgactt 4020 
ctttccatgg gcgtgttcta agcagcctct ttgttccaga agctgccctc agccagagtt 4080 
ggataagcca atcctcactc cccagcctcc tctggatagg gatgaagacc ccactggggt 4140 
tggaagtgca gaggcagaca ggtgtatgga gtcacctgta aattgattca agtgagccag 4200 
gaaagcagca aaggaaagag aaacctgagt gacgacgtgg tggaggaaca gggctggaaa 4260 
gaggctgctg gctgtctggc ttcgcagctc tggcctccta atcagcctcg ctcttgtctc 4320 
tggtgttctc tggctcttgt ccatctgtct gtgtttcttt ttgccagcta ttgactaatc 4380 
tttgctgaag ctgagctaga attctggtgt ttataagcag gtaactagct gagcactagt 4440 
tgataacttt ggagttagtg gataaccaca gggctgggct actaagcttt ttcagttgaa 4500 
aaataaata 4509 



<210> 46 

<211> 8621 

<212> DNA 

<213> Homo sapiens 

<400> 46 

cctcccaaag tgctgggatt acaggcgtga gccaccatgc ccggtcacct ttgtcctgtt 60 

ttctacaggt tagaagcaag gcacaagtca tacctacatt caaggagagg ggaccatgca 120 

aaggataaac tgcaggtgaa gaacttgaga gctattttag aagctgcctt ccacacttta 180 

caatctactt tgcagctata ctgagctctt agcaactccc tgggcctgct gagcacctct 240 

tgcctttctg cctttgcgta tgctgttccc tctgcctgga attccatgtc ttctgccttt 300 

gcccacctca tcttaaggcc tctgcttaga tgcaatttct ttgggctctt attagcagca 360 

tccaaatctg tgctggtttc ttctcacgct tactctgcat cataattttt ctgtttactc 420 

gttaatcccc tcccactcct gccagtacct aaagtaaaat ctacacgggc ttggcacatt 480 

gtggatgctc aatgaagatt tgttaaatga gtgaatgatt aaatgaaata agccccagct 540 

tggaacccag agctgtaatt tccggttgtg gtccctgtcc cctgcccagg ggctgtctaa 600 

gggctcatct ctcccaggag cccctgtatc atctcagtaa ctagaccagt accatatccg 660 

acttcccctg actttgaaat aattttcacc tgcaatcagg tgcgcttccc tagaacctct 720 

ctcgctgttc tttgcctttc tccttccccc gccttgtatc tccgtggtcc tgccaaaaat 780 

cagaagctca ttattaaacc agaagcttcc ttgcattatc ccagagacta cagacagaaa 840 
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atatttgctt ctatctcaac tgaggactga catagtaaag actccttgag atgttagctg 900 
tcaaggaggg gaataaaagt ttttcatctg catgagtaat tgtttcctgt tttatccaga 960 
gagacaacag atttacatcc cagacacatt cacactgggc caagagcagc tgccattgca 1020 
gattcgagtc acgtcctttt ttccttcctt ctccttcttg gagttacaac agaagctggg 1080 
gaggtgagag tgcaggtgag agaggagggg ggctctggaa catgtagcct gtgaggaggg 1140 
aaatcaggat ggtgcttggg tggaggtggg agctgggcca gggtcttggc tggtcaggaa 1200 
ggctggtgaa gcatgtactg gggaggcaga ggtcctgctt tctttcctct ctttccttca 1260 
cacctacctc aacccactgt cgagtcctat cagctctgaa atagatcctt cctccagcgc 1320 
cttctcacct ctctctccac cattccagtc catgccgcct catctcttac tgggactttc 1380 
acaacagcct cccgagtggt cttggcttct gctcgcaaat cctgcagtgc cttctctgcg 1440 
gagccacaag ctggatcttt ttaatccgct ctcggtttct cacagccctt ggaatgaaac 1500 
cttggcctac agccctgtgt gacttgctac ctgtccttac ctcctaacct catctaatat 1560 
ccccttcccc tcctcgtacc ggcctttctg tttctcagac actccaggct tgttctgctc 1620 
ttggaccttc acgctgtctt tctgtctcct gggaatctgt gccccctctg ccagatcatg 1680 
cagctggctc tctctgccca aataccctct tccgaggggt cttccttggc cctgtgtctg 1740 
acataggtac cctcctccat cactccagcc cattacagtg ccataacact tttattgcca 1800 
tctagagtaa catcagttgg ttttgtttat catctgccac cctgctagaa aggacgtgga 18 60 
tgaagcagag aggcacgtgc ctgtctcatt cgcctctgga tctgctgcat ccaggactgt 1920 
gccagcacaa agtgggagcc cgataaatat ttattgaaga aatgaaaggt gacaaaaggc 1980 
agaaggaggg agaccccaaa aagaggtaaa tttggtatct tagcaagtgg gtgactacca 2040 
ctgtctgggg gaaagctgat acacgaggac ttgacaataa ttacagcagt aacaatcgct 2100 
actattttgt tttgaaatga attttagttt ctgaaattgt ccaaagcact tttatcaata 2160 
gaaacttgtt acacatccct ccaacaggga gaagcagcac aggaagctgc cccctcaaac 2220 
actaaaggct cctggagcta ttcacagtgc tggggaacaa ccagaggagg aggaggacag 2280 
agggagagaa tacttttttt aaaaatccca aacaaaacca aattctatac atgtatatct 2340 
cgagtgagta aatgtgtaac actcagcagt ctcctacact tgccaggttt tgtggcttca 2400 
tgcagtaata taaaatgtgc ttcacaaagc taggagttgg cacagaatat agggttctga 24 60 
ttcatatttt agttttactt ttttgtagag atgggatccc actatattgc acaggctggt 2520 
ctcaaactcc tgagctcaag cgatcctcct gccttggcct cccacggtgc tgagactaca 2580 
ggtgtgagcc accactccca gcctgattta tattttataa agaggcaacg atggagtaca 2640 
cagctgagca atctcttgca tttctggacc ttgagcatca ggaagcttag ggagggtgaa 2700 
aactcagaag aaagctcaga gttgagacct ttgatttttt gcccagagag tccagctaac 27 60 
acactttata acccaaaagg ccttacaaat atttcagagc agaggcagca tttttgcact 2820 
ggataaacaa aggtacggta atttcttcat agtttatttt cattacaagt aattcttttc 2880 
ctgaagaaaa atttatattt cagatttatt tcccctctca aaggggaggg gagttataat 2940 
attctgtttc catagcacca ttcctcaaga tcaaacccag gcaggcattt ttacagcaca 3000 
cacttccata tgtcattgct aatcctcgca accttgagga aactgaggtg caaaggggtt 3060 
gaggaatgtg ttcaccatcc caaagccaac gaagtgccag aacccaaatc caagcttggc 3120 
tgtctcatcc ccaaggacgt actctttatg ccgttcctta tgacttttca tgcagcaggc 3180 
gcaaggcagc aagaatttgg ctcttctgac atttatcagg cacctggtat gtgccaggcc 3240 
tactgtaaac cccaagatga catggtccat gtctatccat tcagcatcag agagactgtt 3300 
atcaggactt gaggcctgag acacagccct gaatcagatg caggcctctg gtgtttgcaa 3360 
aggatgcgtg agtatgtgta tgtgtatgcg tgcattgtgt gtaggggtgt gtgtatgttg 3420 
ggggttgggg gcagaaaatg ctccctgaac tgttggtatt aaggcagatc tttgatggag 3480 
ggactggggt gtggagactt gaaccctagt ttcgaagtgg ggacctcgaa cactagtatg 3540 
gccagcagac aggctggagt tgctgaattg tcagctaggt ctttaatcca ctccccttct 3600 
ctaaatcagg gtggaattgg actcagggct ggttcagggc tgaggtttcc gggggatgaa 3660 
tgattccaga cagtgggaag gggtgatgga tgatgaacct ggcaagtcac aggactatgg 3720 
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acttggctgt atgttgcatg tatgtgtaag atctgagttg agacctttga tttttgccca 3780 
gaaagtccag ctaacatact atataagcca gtgagaccag ccagtgtctg ttcattgcct 3840 
ctgccttgcc cctatgcctg cccgcctcct atagcaggtc cctatagcat aatgaaaatg 3900 
cgaacaacct tttctctgga cactggggaa tctagtctgg tggataaaga tggatggctg 3960 
ttaaacgatc acatcttggg ctctgtgtgt tcagtgaagc ctcatttacc agatctctct 4020 
gatgagaaag gtgaatggtg gggacaaaat acatgctgtt atcagaagtc tggcgggggc 4080 
cctgtgcagt ggcaagcacc tgtagtcaca gccactcggg aggcagaggt gggaagatcg 4140 
cttgagccca ggaggttgag gctgcattga gctatgatgg caccattgta ctccagcctg 4200 
gttgacagag tgagaccctg tctaaaaaag gaaaagaagt ttagggggaa agcttttagg 4260 
aggtagatga tggttaattt taattccacc taacatttta aactttggta tgtcaaaagg 4320 
tgagaattgc cagtgtcaag gcatggaaac caaaggaaaa ccagagggtt gagaaggtgg 4380 
gaagggccgc agtgtggcag gcggcagttt ctgggatggc ttctactaaa ggaactcatc 4440 
atccaccacc ttcaggaagg aggctcctgg ggccctttaa gagagcagtt tcagtggagg 4500 
ctgcaggcag gttggcagaa acaagattat gagatggtct gtaccaggta tcaggcttga 4560 
gggttagtgc ctctagactt ctctgagaag tctagctttg aagggaaggc tggagagcta 4620 
gcacaatact caataataac aagacctaag gcacttactg tgtgccaggc actgttctaa 4 680 
gaactcagca cacttcagct catttaatcc ttacaactat acaaagtttg tagtcttttc 4740 
gttatacctt ttgacaggtg agttacttgc tggaattcac gtggctagta aatactggag 4800 
ctgagatttg aaagcaggca atctggctcc agaatctgtg ttttcaaccc acatactata 4860 
ttgaccatca tgaaaagatg gtggtgagca ggcttgtttc taaggatggg agggtcctca 4 920 
taatatttgt atttacagag aggagggaga ggtgggttac ttcctctccc aaccccagaa 4980 
agccaggcat gttggctggc atccctggga cctcagatat tcaagcaacc atggccttac 5040 
aaatcaggca ttctatttat tagagttgtg cagtgaggtc ccctccattt catccccaga 5100 
atatctccta gacaagttgg atcctgcagg gcccgagtat aaataagatg gtgccctctg 5160 
ttgttatgga gtagcttgga gctgggcccc agatgccctg tgtggattag atgagtgatt 5220 
agtatttggc aatgtagctt gagaggcttg ctgagttact ggggcatatg tccccagctc 5280 
acgagtacca gcctgacttc tgtcaatcaa tcagtgaaca gggtttgcag cgtccaccag 5340 
gtacctgtgg tgtagtagaa gagagtttga actttggaat caaacatgct gaagttcaga 5400 
ttccagctcc tccactaact acctctgtga tcttggctaa gtgaccgtca cttttttttt 5460 
tttttttttt gagacaagat ctcattctgt ctcctaggct ggagtgcagt ggcatgatca 5520 
tggctcactg cagcctcaac ctcccgaact caagagatca tcccacttca gcctcctgag 5580 
tagctgggac catggggcac ctgccatggc atcaagctaa ttttttatct tttgtaaaga 5640 
tgggatttcg ccatgttgcc taggccagtc tcaaactcct gggctcaagc gatccaccca 5700 
cctcagcctc ccaaagtgct gggattacag gtgtgagcca ccgtgcccat cctgatggtt 5760 
gctttcagag ccttggttct tcatctctaa aacaaagata ggaatactgc ctagtttcca 5820 
tggggagctt gtaacacatg cgggcctctc tcccctcacc ccagtctttg gttagacact 5880 
tactatttgc aaagacccaa tttcttcatc ataagaaggc tgccaccaca cttgagggag 5940 
tgacccattt acaaacaatt gcttccttct gggtcattct gatggattaa atgaagaacc 6000 
atttaaaacc acagcccaaa gggccagagg gagaggcagc agagggaaga tgcaggtgga 6060 
tcttcatgca ctgcatgagg aagctttgta ccatggtcag gaaaagctcc aggcacacaa 6120 
ggcagcctcc atggccaatc agagtgcacc tctcacctcg gcaactctgc taacctaaat 6180 
agaccagtca cacagcagag ctgcaaaaga agacatgata attacagaga ccccagggga 6240 
ccgtctgtgc ggactgatgc tttgtttctc ctgacccctc agacctctct atttcactgc 6300 
aaactctcaa tacccagggc ctttgttttg ctgtgaacac tttcattcct gagtacattt 6360 
tctatgagga gttggcggaa gagagtcaga atgaattata ttttcagcac acctctgggt 6420 
tcatgttgaa cttggctaga actgttttga cccaggaaac aatttgacta aaggatattt 6480 
caatccaaca aatactctct gggcccttcc aatgtttctg gcactggttt gatgctgagg 6540 
acatagagat gaataagaaa tggtctctgt cccccagctg ataaattatc attgatgatc 6600 
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taccgtgtgc cagggacttt acaaatgtga ttaccatttc ttacaacata gtaaggtaga 6660 
tagcattttt gtgttacagg tgaggagtag gaagttcaaa aagtttataa ggtcaaagtc 6720 
acacaactag tcagatgtag aactaacctt tggatttggg actgttcacc atcaaatcct 6780 
atatgctttt gatacagaca ggagacccag aaatactggg tagaagaggg tggttccctg 6840 
gcaaaggccc taccctcaaa cctggaaacc tgcagcccta aatgggaaga ggcattcctg 6900 
ttttcatgcc caaaagttgc cttttggccc gtcatgtccc ctatcctata cctgtataaa 6960 
ccccagaccc caggctccag aagcagatga ggagacaaac agaagagcag aagaatggca 7020 
gaatggcaca gcagagagaa gaaaaggaac atctgaatgt tgagaggaat ttggctgggg 7080 
gcggttggag aggagattag ccactggaca gccaaactcc aggggaagat catcttccca 7140 
ctccatcccc cttccagctc cccatccatc ctgctgagag ccacctccac cactcagtga 7200 
aaccactgca ttcatccttc aagtttatgt gtgacctgat tcttcctgga cattggacaa 7260 
agacctaggt ccagtgaact gtctaacact taagtcatcc acggacagca aggctaacag 7320 
aatgctgtaa catgtgccca cttggactct gggagttgca gacacccacc cgtaggtgct 7380 
gccatgggtc cggagcccag aaagtgcttg ccctggctcc tgcacctgcc cgtctgcatg 7440 
ctccccctcc ggtaaggggt ttgaggatgt ggcggctgaa cataagagct atgcccctgt 7500 
cacacatcat gggacagggg tcagggaact ctcccatttc actttcaaat attccaggtc 7560 
ccctttttgt tcgggagcca gagtctagtg gagacagaga tagaaaccct gaaataggat 7620 
ccaggcaata taaacctaca cgctgtaagc caagtcagag tgaggctgat gtgataacaa 7 680 
aagtctgaat acaaggtggg agacaattat tctgactgaa agggaggcaa gagttctgcc 7740 
ataggcttga tgctgtgtgt ccctggcttc ctaactcatt tgcaaatgga aaatactatt 7800 
ccattgtagg aaccttcttt ttccatttgt ctggggaagg agaaagaatg gctgggctga 7860 
atgaatttat cccttggtcc tcttccaaca aagctcagct atgaaagata aatccagctc 7920 
tccccacccc ctctcagtgg agctggggag aaatcaaaag cccctctgcc aataatgaga 7980 
ccaaagtttg caagggcagg acgagcccgt gctaacagag aaagtgttgt ttcctcaatt 8040 
tggttttaga ctgtcttgtc ctatggggga gaaaagatct gcccttggga gaggtgccaa 8100 
ctttatagat ctattaataa aagaactggc aggcttacag ttcttgccaa tgaggaaact 8160 
tgaatgagag aagccaggct caaccttggc caacagactg gagcccatca ccctaacttc 8220 
accccgcttc tccttaccca accgtcaaag gctaggcagc acccacccag cagcttccac 8280 
ctggctgaag cctgcacctg cttcagacca agggttagat ggaaatttgg catgggaaga 834 0 
gagggctcac ctgtgggcag gatagactct atccaagaag gagaactgaa aaatgaaaac 8400 
ctatgagaca aggggtgatc ctgaaggcag gcaggagaaa gggctggagg gagaggcact 8 4 60 
ggggaatttt tcctggtgaa tactgaagtt actagatgtt ttgtcttgca aaactcaagg 8520 
gaaaactctc aaactctaat gtttgtctat tctgtgtcca aactgtcctt ttgaaacgga 8580 
cccacctttc agtaaagaaa cttgcattgg cctgcccagc c 8621 



<210> 47 

<211> 1040 

<212> DNA 

<213> Homo sapiens 

<400> 47 

gtgattcccc tgtctgaccg catttaagaa ggcaggcaac tggaaggctt cattactatg 60 

gctgctgcag cttttgatcc cctggggcca cttccagtct acctggtctc cgttagactg 120 

ggcatctatg aggatgaaca tcatagagtg tggatagttg caaatgtgga aacttctcac 180 

tcatcacatg gcaacaggag gagaacccat gtaactgtcc acttgtggaa gttaatcccc 240 

cagcaggtta ttcccttcaa ccccttgaat tacgactttc tacccacgac gtggaaatta 300 

gagtccagga acatatactg ggcaacagat gggactcact ggagattact ggatcattcc 360 
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cagttgggtg acacagagca actgatcctg atgctggtgt tagggtgact gactgatgga 420 
agctgctggg aaggtcttga tagcctggct ggctcctgct cctgtgccct cgggtagact 480 
ttgcacccac ccctctgttt gttcctcaag tttctagttc tacagcaact tctgtcatgg 540 
tcagacctac tttttacatg atctactttc tgtgagcctg gtttgggtcc tgaccgttct 600 
ccctagctgg agacaaactg catcttctct tgcccacttt gtgtctctgg ccctgacctt 660 
atagatggct tttcttaaga cccctcctcc attcaaactc cttccacctg ggcttctagt 720 
gcttccagtg gtaggagatt tgccccccat ttaccttagg atgtctacag gtgcctctgg 780 
aggaggtaca gatagggact ctatgccctg tgggcctttg acctatcact ggcaagaagt 840 
ttgcctggtg ctgggtagag gacactctgt gctcagtagc ttaaaggctc agtgtgagac 900 
ttacagagaa catctggagt acttgtccag gtgctcagtt ccttggctcc aggaagtgaa 960 
tctgacccta ggcctctccc accccaactt gtttttaact aattaaactt tactgttttt 1020 
acttaaaaaa aaaaaaaaaa 1040 



<210> 48 
<211> 1020 
<212> DNA 
<213> mouse 



<400> 48 

gtcattcccc tgtctggccg catttaagaa ggcaggcaac tggaaggcct cattactatg 60 

gctgctgcag gtttttatcc tccgaggcta ctcccacaag ttctgatctc cacaggaccg 120 

ggcttctatg aggatgaaca tcatagactg tggatggttg caaaactgga aacttgttct 180 

cattcaccat attgcaacaa gattgaaaca tgtgtaactg ttcacttgtg gcagatgacc 240 

agataccccc aggagcctgc tccctacaat cccatgaatt acaactttct acccatgacg 300 

tggagattag catccatgaa cacataccgg ggaacagatg ccatgcactg gagattactg 360 

aatcattccc aggttggtga cacagtgcaa ctgatcctga tgctggagta aaggtcttga 420 

tagcctggct ggctcctgcc cctgtgccct tggggtagac tttgcaccca cccctctgtt 4 80 

tgttcctcaa gtttctagtt ctacagcaac ttctgtcatg gtcagaccta ctttttacat 540 

gatctacttt ctatgagcct ggcctatctc tgtcctgacc gttctcccta gctggagaca 600 

aactgcatct tctcttgccc actttgtgcc tctggccctg accttataga tggcttttct 660 

taagacccct cctccattca aactcctcca cctgggcttc tagtgccttg tgtggcagga 720 

gatttgtccc catttacctt aggatgtcta caggtgcctc tggaggaggt acagataggg 780 

actctatgcc ctgtgggcct ctgacctatc actggcaaga agtttgcctg gtgctgggta 840 

gaggacactc agtgctcagt agcttgaagg cacagtggga gacttacaga gaacatctgg 900 

agtacttgtc caggtgctca gttccttggc tccaggaagt gaatctgacc ctaggcccct 960 

cccaccccaa cttgttttaa agtaattaaa ctttactgtt tttacttaaa aaaaaaaaaa 1020 



<210> 49 
<211> 1058 
<212> DNA 
<213> mouse 



<400> 49 

gtcctgcccc tctctgaacc tcatttaaga aggcacgcaa ccggacggcc tcattactat 60 

ggctgattca gttcattttc cctggatgcc attcccacct cgcttcctgg tctgcacaag 120 

agatgacatc tatgaggatg aaaatggtag acagtgggta gttgcaaaag tagaaacttc 180 



23 



tcgttcacca tatggcagca ggattgaaac atgtataact gtgcacttgc agcatatgac 240 
cacaatcccc caggagccta ctccccagca gcccattaat aacaactctc tccccacgat 300 
gtggagatta gagtccatga acacatacac gggaacagat gggacatact ggagattact 360 
ggatcattcc cagatgggcg acacattgca actgatcctg gacatagtaa tatgcgaggt 420 
tgactgaatg atggaagctg atgggaaggt cttgatagcc tggctggctc ctgcccctgt 480 
gcccttgggg tagactttgc acccacccct ctgtttgttc ctcaagtttc tagttctaca 540 
gcacctcctg ccatgattag acctgctttt tagattatct actttctgtg agcctggcct 600 
aactctgtcc tgaccgttct ccctagctgg agacaaactg catcttctct tgctcatttt 660 
gtgcctctgg ccctgacctt atagatggct tttcttaaga cccctcctcc attcaaactc 720 
ctccacctgg gcttctagtg ccttgtgtgg caggagattt gtccccattt accttaggat 780 
gtctacaggt gcctctggag gaggtacaga tagggactct atgccctgtg ggcctctgag 840 
ccatcactgg caagaagttc gactggtgct gggtagaggc cactcagtgc tcagtagctt 900 
gaaggcacag tgtgagactt acagagaaca tctggagtac ttgtccaggt gctcagttcc 960 
ttggctccag gaagtgaatc tgaccctagg cccctcccaa ccccaacttg tttttaacta 1020 
attaaacttt attgttttta cttaaaaaaa aaaaaaaa 1058 



<210> 50 
<211> 1040 
<212> DNA 
<213> mouse 

<400> 50 

gtcattcccc tgtctgaccg catttaagaa ggcaggcaac cagacagctt cattactatg 60 

gctgattcag ttcgttttcc ctgtatgcca ttcccacctt gcttcctggt ctgcacaaga 120 

gatgacatct atgaggatga acatggtaga cagtgggtag ctgcaaaagt ggaaacttct 180 

tctcattcac catattgcag caagattgaa acctgtgtaa ctgtccactt gtggcagatg 240 

accacactct tccaggagcc tagtcccgac tctctcaaga ctttcaactt tctacccagg 300 

acatggagat tagagtccag gaacacatac cggggagcag atgccatgca ctggagatta 360 

gtgaatcatt cccagtttta tggcacagag gaactggtcc tgatgctgga ttcaaggtaa 420 

ctgactaatg gaagctgctg ggaaggtctt gatagcctgg ctggctcctg cccctgtgcc 480 

cttggggcag actttgcacc cacctctctg cttgttcctc aagtttctag ttctacaaca 540 

cctcctgcca tggtcttttt acatgatctt ctttctgtga gcctggccta tctgtggctt 600 

gacggttctc cctagctgga ggcaaactgc atcttctctt gctcactttg tgcctttgac 660 

actgacctta tagatggctc ttaagacccc tcttctcttc aaaatcctac acctgggctt 720 

ntagtgcctt gtgtggcagg agatttgtcc ccatttacct taggagttct agaggtgccc 780 

ctaaaggagg tacagataga gactctatgc cgtgtgggcc tctgagccat cactggcaag 840 

aagttcgcct ggtgctgggt agaggacact cagtgctcag tagcttgaag gcacagtggg 900 

agacttacag agaacatctg gaggacttgt ccaggtgctc agttccttgg ctccaggaag 960 

tgatgctgac cctagacccc tcccaacccc aacttgtttt taactaatta aactttactg 1020 
tttttactta aaaaaaaaaa 1040 



<210> 51 
<211> 1050 
<212> DNA 
<213> mouse 
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<400> 51 

gtcattcccc tgtctgaccg catttaagaa ggcaggcaac cagacagcct cattactatg 60 

gctgctgtgt ctgttgatcc ccagaggcca ctcccagtcc tcctggtctc tgttagcctg 120 

ggcatctatg aggatgaaca tcatagagtg tggatagctg taaacgtgga aacttctcat 180 

tcatcacatg gcaacaggat tgaaacatgt gtaactgtgc acttgcagca tatgaccaca 240 

ctcccccagg agcctactcc ccagcagccc attaataaca actctctccc cacgatgtgg 300 

agattagagt ccaggaacac atacacggga acagatggga catactggag attactggat 360 

cattcccaga tgggcgacac agtgcaactg accctggaca taataatagg cgaggatgac 420 

tgaatgatgg gagctgctgg gaaggtcttg atagcctggc tggctcctgc ccctgtgccc 480 

tcggggtaga ctttgcaccc aaccctctgc ttgttcctca agtttctagt tctaccacac 540 

ctcctgccat gactagacct gctttttaga ttatctactt tctgtgagcc ttgcctatct 600 

ctgtcctgac cattctccct agctggagac aaactgcatc ttctcttgct cactttgtgc 660 

ctctgacact gaccttatag atggcttttc ttaagacccc tcctccattc aaactcctcc 720 

acctgggctt ctagtgcctt gtgtggcagg agatttgtcc ccatttacct taggatgtat 780 

acaggtgcct ctggaggagg tacagatagg gactctatgc cctgtgggcc tctgagccat 840 

cactggcaag aagttcgact ggtgctgggt agaggacact acgtgctcag tagcttgaag 900 

gcacagtggg agacttacag agaatatctg gagtacttgt ccaggtgctc agttccttgg 960 

ctccaggaag tgaatctgac cctaggcccc tcccaacccc aacttgtttt taactaatta 1020 
aactttactg tttttactta aaaaaaaaaa 1050 



<210> 52 
<211> 6921 
<212> DNA 
<213> mouse 

<400> 52 

gtcattcccc tgtctgaccg catttaagaa ggcaggcaac tggaaggctt cattactatg 60 

gctgctgcag cttttgatcc cctggggcca cttccagtct acctggtctc cgttagactg 120 

ggcatctatg aggatgaaca tcatagagtg tggatagttg caaatgtgga aacttctcac 180 

tcatcacatg gcaacagggt gagtttcagg gctccagaca ggctattccc caactcttac 240 

ccctgtgagc tagggtgtgt ttacgtgttt gtgtgtgtgt gtgttgtgtg taggtatgta 300 

cttttattac cgtgtcagaa aatagctgat acaggtctct gaatagaacc ttgtgtatac 360 

ctaggcccat accctggagc tcccttggag acctcacagg cccaactaat acacaggtgc 420 

agagccctag cattgagaac tccgttacag atagtcagag agaacatgct gagagagctt 480 

tgttaggctg ggctttgtgg gatgagtagg agttctacag gtcagaaaaa aaaaaaaaag 540 

cactctggtg ccaggcagat tccctggaag gagcaaatgt gagtgccagc actactttcc 600 

gcatttctgc aaatcataat tcatctgctg ctgttgctcc cagacttgag tggcctgaag 660 

aagtgtttga cagacaaggg atccttacac taggagctga ctacaggact tgggggaagt 720 

cctaatgggt tctgtgatct gcaggtgatg tagagaccca cactgctgga aacctgctct 780 

ggaccatggc tttagtccac agcaggacct tgggggctaa aacactctag tggaaagctc 840 

tgggcattaa aactgctccc cagggtcgtg gaaaagacta tgggccacca ccaccacccc 900 

cactgccctg ggggtgtgga ggacttgggg gcattaggtc tctttagctg ccagggtttt 960 

cctgcacacc caccccctgt cctaacctcc agcccagcag cccttccctg cagctcccct 1020 

gactccctga atacctcagc cattttggca atgggactct cttggtcccc ccctcccctc 1080 

tctccttgac ttacaaggcc cagatcaagg tcatgttcac tctggactct cccaaatgtc 1140 

tgtttttgcc tgtgttctcc tttttaattt actataaaac tcctctttct cctccccctt 1200 

ttctttcccc tccagtcact tatcccctct cctccccctc tcctccctac ctaggagcag 1260 
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ttctgccatc cttttccatt ccttgtttta tattcatctg gagaaaactt tgatatttta 1320 
aggatacatc agccattttt tggatttgag acccaacaag atagtaattc acctgtccag 1380 
tcatgagaaa aaaactcttc aagttgataa gtgttttgtt tttttaaaaa acttcaatat 1440 
ttatttactt attttagtat atgagtatac tgtagctgta tacatggtta tgagccttca 1500 
tgtagttgtt gggaattaga ttttaggacc tctgcctgct ctcctctgcc ctgcttgatc 1560 
taatctattt atggttgtaa ataggtacac tgtagctgtc ttcagatgaa ccagaagagg 1620 
gcatcagatc acattacagg tggttgtgag gcaccatgtg attgctggga tttgaactca 1680 
gtaccttcag aagagcagtc agtactctta ccacctgagc catcttgcca ggtccgtaag 1740 
agttttcttt aatcccatac caaataaagt gtaaaatagt ttttgaccaa atattttcag 1800 
taggaagaaa ggaaggaagg aaggaaggga agggaaggga aggaaggaaa ggaaagaaga 1860 
aatggaagga agaaaagaat gaaaggaagg gagatacaga aacgaattct gttagggtat 1920 
aaatgaaaga gctgcaaagg aaataatgtg tagtcaatta aattcaaaca aaacacatgt 1980 
ctcccagcca ttctgcttct gggctgggct tgtgttgaag cagccattgg tcccacgtgt 2040 
gcatagaggg gcaggtgcag caaccaaatg agctccttgt acaactgagt gtaaagggga 2100 
gggtgcaggg aagggtctga tggcatttgt aaaaggagga acctgaggat ccaggcagag 2160 
ctgtctctaa gggatacatc caccagaggt gcactgtgga gtgcctacac agagacagaa 2220 
agcacaatgg tgggagggag ggggagttcc aagaccctat ggttgccata ggtttggagc 2280 
aggtgagtgg acagaaagac tctgggatgg gatggggcca tctcaggacc ccccttacca 2340 
cccctgtgca ccctgaaatc acgacagtgg ccagggaaag gttccataga aatagaaaga 2400 
ttggagaaat ttaggcacag gtggccctga attatttccc taattaaggc aatcccccct 24 60 
ctgtctctct gtctgtcaac ctatagatta cagtactgag gactttgcaa catgctagca 2520 
aaatgttcta caactgaggc acattccagc cctcctgcta ctttttatgt tgcaacaggg 2580 
tgtcactaaa ttgcccatgt tggtcttcaa ctgtagtgca ggtgggtctg gaacttgtga 2640 
tcctcctgcc tcagcctcct gagtagtctg ggattccagg ttgcaacact aggcccagca 2700 
ttgagcaact ctttttcttc ttcctgcttt tcttcccttt cccactcttg gttagtatac 2760 
taagaaatag gttgcactgt tacattctct ctctctctct ctctctctct ctctctctct 2820 
ctctctctct cacacacaca cacacacaca cacacaccca gcagaaacag ggacatctaa 2880 
attcaagggc agattggcct ataaagccag ttgcagtcta ggcagggcta cacagagaga 2940 
tcctgtctca aataagcaca tagctagata gataaataaa tctctaaagc aaacataacc 3000 
ctgtgaacct gagttaggat ccctgaataa aaagtcaggt ttgactttat tccctcaact 3060 
tcagggctgt ggggagttta gtcatagagg agaaattgat aggacagata tcagagtcct 3120 
cctttggtcc tttcaggcat atatatacac acacacacac acacacacac acacatatat 3180 
atatatatat acatacacac acacacacac acacacacac acacatatat atatatatat 3240 
atagtacatg agtgcataca cacgcatgca cacatataca cccataaaca ctcatgcaca 3300 
gatacataca tatacccaca tacacacaca tgtacacaca catgtacaca tacacactca 3360 
tacacatata tacatacata cacatgcaca cacacagcac tcaaacatat acacccacaa 3420 
gcacactgtg tatgtcagag ttcaaacaga ccattgcttg cttggttgac agcaccttct 3480 
cactgggctt tttcaatcac cggcctccaa gagaagctta tctttctgca ggcactgagg 3540 
acctaggaac gcccttctgt gccaccttag ggcaaaagca tccgcagaca acagcagttt 3600 
tgaatgaaac tttatgtact acaaaagcag gtggcaagcc tacttgggcc cttgtccctt 3660 
ctttgctatc ccccttctgt ccaccctgac atcacaacac aagcagaagg gacagacttt 3720 
gagaccatag gatggcagcc aggaaaatga ctccccacag tgtgacctat caaagtgtgt 3780 
gtctttagct caacaaaaca aaggcatgga tcaggtagac aggaggacgc tggtaggaag 3840 
tctgtgacac actttgtggt ttgtggtcat gcttggtatt tggttatggc ttttaggacc 3900 
tctcttactt tgtgactttc tgtgttggat caggtttatt catcaataaa tggtacatga 3960 
agttcaccat caacagatca atggatttct tttagtaaag ccattgcccc tggtgtctgt 4020 
gtgtgttctg tgtgtttgcg tacatgactg ttaatgtgtg catgtgtgtg tgtttgtctg 4080 
aaggtgtttt gtgtgtttgt gtatgtacct gttcatgtgt gcatgtgtgt gtgtgtgtgt 4140 
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gtgtatacaa gagtgtgcat ttccatgtgt gtgtgtaaaa cattcaggtg agctgccctg 4200 
tgactttctt attcccttga taaaacatcc ttccctaaat ctgcagtacc agtggcagcc 4260 
agccaacccc agtgatcctc ctgcctcagc tccccaagca ctggtgcaaa tataactgta 4320 
gctttttact tgggtgctgg tgattcaaac ttaggggctc ctgcctgtgc ggcaggtgct 4380 
cttatctatc aaggcatctc cccactctct gtcttacttt tctgaagaga aatttgataa 4440 
ctctacaaat ggccaagaca aagtccaaag gggcgattta gagcctgtga aaggcactga 4500 
cagcctggag acagtggcta ctttgtggga gatactctgg ggttaagcag gggtggtgga 4560 
agagagtgcc agagaggcag aagagcagca gacaccccag gccctgagga aaaagaccta 4620 
ggagcagcaa acgagcaagt caatagcatg gctgatggaa ccctgctcct aatgtactgc 4 680 
tctctctatt cccttcagag gagaacccat gtaactgtcc acttgtggaa gttaatcccc 4740 
cagcaggtta ttcccttcaa ccccttgaat tacgactttc tacccacgac gtggaaatta 4800 
gagtccagga acatatactg ggcaacagat gggactcact ggagattact ggatcattcc 4860 
caggcaagtg tttggtaccg caagataaaa gctgcagcag gtgtcccatg cttttcattc 4920 
tgacccaccc ttgggaggtt cttctccctc tcttccccac agtctagccc ttggtccatt 4 980 
tttagtggac agttgggaag tatggtatga gaaaactttt cttggggaga tgagacaact 5040 
caagtaccca gcatagacta ggaacccaca acagatcaaa gtatgcacag cacagaagcc 5100 
caacttggtg aaccagtgaa ctttactggg ttgacataca ggagcaccag tgtggggtca 5160 
ctcacaggac cagaaatgat ccagttggat gcatcaccaa agcccaccac aaccttggtg 5220 
acaaacagct cacaaaagct gggaccttga agcacagtac ccagcttaca gtcagctaac 5280 
aggtgggaga gtgccctttc caggtgcctc tgctggtcta aagctcctct aggcagcttg 5340 
gctggtcagc ttcttctggg tccttttgtc tgagtttgag gcttgactct gctgcttctg 5400 
gtagggaggg acggtgtgat tctgctcagt ttcaggaagt tcttgaagct actattgttg 54 60 
ttgttatttc cctgcttcag ggtgttactg tattgcaaca gagggagaca gggagaggga 5520 
gaaaggagag agggagggta cagagaggga ggaggggaag aaaagagaag atagagagag 5580 
aatgcacatg gggacacttc tgcctactgg tgtaggtgtg aaagtaatgc ttccctctgg 5640 
ctacagaatc cagagaccag gtcccctttt gctacttgtc ccttcctgat gcctcaggtc 5700 
tgaatgggtc actagggctg gctagacttg ccttacagga ccctgatggc tgtaccctct 5760 
cctacagttg ggtgacacag agcaactgat cctgatgctg gtgttagggt gactgactga 5820 
tggaagctgc tgggaaggtt tgtcagggtc agggaccacg accctcactg gtgaggtcag 5880 
acagtgagcc gctgactttc cctctttata ggaagacagt gctggcatct ctcccaagct 5940 
ttcctggtgt aacacacaca ggaagcacag gacatcacct gaatgaccta aagtcaccgc 6000 
tcacccctgg gggaaaaaac ctgaatgttc tgtgttccac atcatagcta agcaaaagag 6060 
cactgggtat tggaattgtg ggtcctatga tctcagagcc ctatgatcac actagaatcc 6120 
agtgactctg gatctcaagt ttaatcatac agtactgtgt gatcctggcc acaggggact 6180 
tacactgggc atctcctcca acttggtagc agctctctct ggctaggtag agcttggacc 6240 
atgggcaggg ctggctggag cccaagcctt tggatctagg acccagttct gcagggatcc 6300 
actaatcagt gtctcatttg ccttctacag gtcttgatag cctggctggc tcctgctcct 6360 
gtgccctcgg gtagactttg cacccacccc tctgtttgtt cctcaagttt ctagttctac 6420 
agcaacttct gtcatggtca gacctacttt ttacatgatc tactttctgt gagcctggtt 6480 
tgggtcctga ccgttctccc tagctggaga caaactgcat cttctcttgc ccactttgtg 6540 
tctctggccc tgaccttata gatggctttt cttaagaccc ctcctccatt caaactcctt 6600 
ccacctgggc ttctagtgct tccagtggta ggagatttgc cccccattta ccttaggatg 6660 
tctacaggtg cctctggagg aggtacagat agggactcta tgccctgtgg gcctttgacc 6720 
tatcactggc aagaagtttg cctggtgctg ggtagaggac actctgtgct cagtagctta 6780 
aaggctcagt gtgagactta cagagaacat ctggagtact tgtccaggtg ctcagttcct 6840 
tggctccagg aagtgaatct gaccctaggc ctctcccacc ccaacttgtt tttaactaat 6900 
taaactttac tgtttttact t 6921 
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<210> 53 
<211> 8196 
<212> DNA 
<213> mouse 



<400> 53 

gtcattcccc tgtctggccg catttaagaa ggcaggcaac tggaaggcct cattactatg 60 
gctgctgcag gtttttatcc tccgaggcta ctcccacaag ttctgatctc cacaggaccg 120 
ggcttctatg aggatgaaca tcatagactg tggatggttg caaaactgga aacttgttct 180 
cattcaccat attgcaacaa ggtgagtttc agtgctccag acaggcgatc ccccaactct 240 
tacccttgtg tgctaggctc tgttgtgtgt gtgtgtgtgt gtttaggtag gtgctgttat 300 
tatcgtgtca gaatagatga tacaggtaat tgaatagaaa catgtcccta taccctggag 360 
cttgcttgga ggcttcacag gtccaactaa tagacagatg aagagcccta gcagtgagga 420 
cgctgttaca gatagagaga acatgctgag aagggtttgg taagctgggc tttgtgggat 480 
gagtaggagt tctacaggtc agaaaaaaac aaaacaaaac aaaacaaaca aataaccaac 540 
tctggtgcca ggcagatttc ctggaaggag caaatgtaag tgccagcact actttccgca 600 
tttctgcaaa tcataattca tctgctgctg ttgctcccag gcttgagtgg cctgaagaag 660 
tgtttgacag agaagggatc cttacactag gagctgacta cgggacttgg gggaaagtcc 720 
taatgggttc tgtgatctgc aggtgatgta gagacccaca ctgctggaaa cctgctctgg 780 
accatggttt tagtccacag caggtccttg ggggctaaaa cactctctaa tggaaggctc 840 
tgggtatcaa aactgctccc cagggttgtg gaaaagacta tgggccccct cccctggcct 900 
ggaggtggag agaacttggg ggccttaagt ctctttggct gccagggttt tcctgcaacc 960 
ccaacccctg tcctaacctt ttccagccca gtagcccttc cctgcagctc ccctgactcc 1020 
ctgaatactt cagccatttt ggcaatgggg ctctcttggt tcccccctcc cctccctcct 1080 
tgacttgcaa ggcccagatc aaggtcatgt tcactctgga ctctcccaaa tgtctgtttt 1140 
tgcctatgtt cttcctttta tctactatga acctcctctt tctcctcccc cctcctcctc 1200 
cccatcctct cacttctccc cctcctcctc agcctcctca cttctccccc tccgccaaac 1260 
ccttctcccc ctcctcttcc tgtacctcct ccataccaag gagcatttat ggcttcattt 1320 
ttcattcctt gttttatatt catctaaaga tgactttgat attttaatag acacctcagc 1380 
catgtctttg gtttgagacc taacaacagg gtaattcacc tgttcaatta tatatatata 1440 
tatatatata tatatatata tatatatata tatatatatc ttcaagttga taagtgtttt 1500 
ctttttttct cttttcatta aaaaaactcc aatatttatt tatttatttt aatatatgag 1560 
tacactgtac tgtatacatg gttgtgagcc ttcatatggt tgtttggaat tggattttag 1620 
gacctctgcc tgctctcatc tgccctgctt gctctgatct atttattatt ataaataggt 1680 
acacggtagc tgtcttcaga tgaaccagaa taggacgtca gatctcatta caggtggttg 1740 
tgaagcacca tttggttact gggatttgaa cttagttcct ttggaagagc agtcagtgct 1800 
cttacccact gagctatctt gccagcccca ataagagttt tccttaaccc tatactatac 1860 
aaagtgtaga atagttttga accaaatatt ttcagtagaa acaaagaaag aaagaaaaga 1920 
aggaaggaag gaaggaagga aggaaggaaa aaagaaagaa agaaagaaag gaaggaagga 1980 
agaaaggaag aaaggaagaa aggacgaaag gacaaaagga agaaaggacg aaaggaagaa 2040 
aggacaaaag gaagaaagga cgaaaggaag aaaggacgaa aggaagaaag gatgaaagga 2100 
agaaagaaag aaagaaagaa agaaagaaag aaagaaaaaa aagagaaaat amnaaaaaga 2160 
gagagaaaag gnmggaagaa aagaagaacg gaaggaagat ccagaaaaga attctgttag 2220 
ggtataaatg aaatgagctg caaaggaaat aatgtgtaat caattaaatt caaatagaat 2280 
nacatgtctc ccagccattc agctctgggc tgggcttgtg ttgaagcagc ccattggtcc 2340 
cacgtgtgca tagaggggca ggtgcagcaa ccaaatgagc tccttgtaca actgagtgta 2400 
aaggggaggg ggcagggaag ggtctgatgg catttgtaaa aggaggaacc tgaggatgca 24 60 
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ggcagagctg tctntaaggg atacatccac cagaggtgca ctgtggggtg ccttcccaga 2520 
gacagaaagc acaatggtgg gagggagggg gagttccaag accctatggt tgccataggt 2580 
ttggagcagg tgagtggaca gaaagactct gggatgggat ggggccatct caggaccccc 2640 
cttaccacct ctgtgcaccc tgaaatcacc acagtggcca gggaaaggtt ccataggggt 2700 
agaaagattg gagaaattta ggcacaggtg gccctgaatt atttccctaa ttaaggcaat 2760 
cccctctctg tctctctgtc tgtcaaccta tagattacag taccgaggac tttgcaacat 2820 
gctagcaaaa tgttctacaa ctgaggcaca ttccagccct cctgctactt tttatgttgc 2880 
aacagggtgt cactaaattg cccatgttgg tcttcaactg tagtgcaggt gggtctggaa 2940 
cttgtgatcc tcctgcctca gcctcctgag tagtctggga ttccaggttg caactctagg 3000 
cccagcattg agcaactctt tttcttcttc ctgcttttct tccctttccc actcttggtt 3060 
agtatactaa gaaataggtt gcactgttac attctctctc tctntctctc tctctctctc 3120 
tctntctcwc acacacacac acacacacac acacacacac mcasccagaa acagggacat 3180 
ctaaattcaa gggcagattg gcctataaag ccagttgcag tctaggcagg gctacacaga 3240 
gagatcctgt ctcaaataag cacatagcta gatagataaa taaatctcta aagcaaacat 3300 
aaccctgtga acctgagtta ggaccccaga ataaaaagtc aggtgtgact ttgtttgctg 3360 
aacttcatgg ctgtggggag tttagtcaca caggaaaaag tgataggaca gacatctgag 3420 
tcctcctttg gccttttcag gctggtatat atacatatac tgtatgtgag tccacagaca 3480 
cacacccata tacatgcata cacacgtaca tacacatgta cacacacacc catacacaca 3540 
caagcacaca cacatatgtg cacacataca caccacacac acacatacac gaatacacac 3600 
acaggcatac acacgtgcac atataccctc atacacacac acaggcacac acacatgcac 3660 
acatacccac acatacacac ccatatacac acatgcacat acacatacat gtgcacacat 3720 
gtgcacacac agaaacacac acaaaaacat gcacacatgc acacacatat gcacacacaa 3780 
atgcacgcac acacacacac acacatatgt ggatcaattg gctatataca gacagcttgg 3840 
tctccagtgg agttgagaga tgaaaacctg ctgaaaccca gtgttggtca ttttcaccta 3900 
catgggacag aaggcattcc cccatctcct ggacccctgg ctcctgtcga agttaccacc 3960 
cccacaaccc ccacaggaga ggctggtggc tttcagtctt gtagacaatg ccccaagctt 4020 
ctgcccttta ggctagactc ctcccagtta cctagcaaca acaggataac aaggggctgt 4080 
ttggccccac ctcactctct ccctcttact ctctaatctc ttactctcta agctttacct 4140 
ctctctttaa gctttcttag tctctagcct ttattttctc tctccttttc tcctttatct 4200 
cctcttggcc atggtcagtc tctctctttt accttttctc aatccccctg cctttctaca 4260 
ataaagctct aaaagcatag actgtctctg ttcatcaagg accagagtta aaactctcgc 4320 
ctgcgtggga acctctctct ctcttcccac tttctgtatc tccaaggcct ggtgctgctc 4380 
cggggcctct attctgttct tatctccttt gcactatcca gcgtgggata cagaagactg 4440 
agaacttagt atctgtgact gccccttgtc cacccctggc agtgtggagt cagtggcttc 4500 
acacagccag atgcccaccc aggcccaagt ggaaagcatc cgttagtcct ccctcatcta 4560 
cctgcccaga gcacaggaac tctggccaaa cacaagctct ctccctgccc cctcattccc 4 620 
ctacacccac agcaccacac acacacacac acacacacac acacacacgg tgtatgtaag 4680 
agttcacaca gtctgttgct agcctggttg acagcacctt ctcactgggc tttttcaatc 4740 
acaggcctcc aagagatgct tatatttctg caggcactga ggacctagga aggctcccct 4 800 
gtgccacctt agggcaaaag catccagaga caacagcagt tttgaatgaa actttatgta 48 60 
ctacaaaagc angtggcaag cctacttggg gcccttgccc cttctttgct atcccccttc 4 920 
tgtccaccct gacatcacaa cacaagcaga aaggacagac tttgagacca taggatggca 4980 
gccaggaaaa tgactcccca caatgtgacc tatcaaagtg tgtgtcttta gcaaaacaaa 5040 
ggcatggatc aggaagacaa gaggacgctg gtaggaagtc tgtgacacac tttgtggttt 5100 
gtggttatga ttggtacttg gttatggatt ttaggaactc ttttactttg aggatttctg 5160 
tgtgggctaa gnawtntttt ttacatctat aaatggtgca tgaagttcac catcaacaga 5220 
tcaatggatt tcttgtttaa agccattgcc ccctggttct ttgtgtgtgt gtatttctgt 5280 
gtgtgtgtgt gtgtgtgtcc tgtatgtttg tgtncatacc tgttcatgtg tgcatgtgtg 5340 
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tgtgtataca agtgtatgca tttgcatatg tgtgtgtgtg tgcgcgcgtg cacgcgcgtc 5400 
tgtgtaacat tcaggtgagc tgccctgtga ctttcttatt cccttgataa aacatccttc 5460 
cctaaatctg cagtaccagt ggcaagccaa cccaacccca gtggtccttc ytgcctmagc 5520 
tcccccaagc actkggtgca aatataactg wagctttttt acttgggtgn tggtnnknna 5580 
atcttagggg ctcctgcctg tgcggcaggt gctcttatct accgaggcat ctccccactc 5640 
tctgtcttac ttttcagaag agatttttga aaactcttaa aatggccaag acaaggtcag 5700 
aaggggccat ttagtgccat gaaaggcatt acctggagcc tggaggcagt ggctactttg 57 60 
tgggagatac cctagtgttg aacagggttg gtggaagaga gtgccagaga ggcagaagag 5820 
cagcagacac cccaggcnct gaggaagaag acctaggtgc tggagacaag gctgtggtta 5880 
gsggtgttga ccacngcagc aaacaaccca gtctcaggca tggaggatgg aaccctgctc 5940 
ctaatgtacc gctctctcta ttcccttcag attgaaacat gtgtaactgt tcacttgtgg 6000 
cagatgacca gataccccca ggagcctgct ccctacaatc ccatgaatta caactttcta 6060 
cccatgacgt ggagattagc atccatgaac acataccggg gaacagatgc catgcactgg 6120 
agattactga atcattccca ggcaagtgtt tgctacctca agataaaagc tgcagcaggt 6180 
gtccagtgct tttcattctg acccacccgt gggaggttct tctccccctc ttccccacag 6240 
tcaagccctt ggtccwtttw agtggatggt tagggtctgt cgcatgacaa accttttctt 6300 
ggggagatga gccaaactca agtacccagc acagattggg aacccacaac agaccaaagt 6360 
atgcacagca ctgaagccca acttggtgaa ccagagaact ttactgtggt tacatacagg 6420 
agcaccagag aggggtcact cacaggagca gaaatgagtc agttggctgc atcacgaaag 6480 
cccaccacag acttgttgac agagagttca aaaaagctga tgccttggat cacagtgccc 6540 
agcctacggt cagctaacag gttggagagt gcctttgagg tgcctccgct ggtccacgag 6600 
ccatttcagg cagcttggct gatcagcttc ttcagagttc ttttgtctga gtctttgagg 6660 
cttgactctg ctgcttcagg cagggagggg cagtgtgatt ctgctcagtt tcaggggagt 6720 
tcttgaagct acttttgttg ttgttacttc cctgcttcag gaagtgactg ctatgaaaga 6780 
aagggtcaga gggaaggaga gagggggcag agagggagag gaagggggaa agatagaagg 684 0 
tagagagtga atgcccatgg ggacacttct gcctactggt gtaggtgtga aagtaacgct 6900 
tccctctgac tacacaatcc agagaccagg tccccttttg ctccttgtcc ctttggctgc 6960 
ctcaggtctg aatgggcaac tagggctggc tagatttgcc ttacaggacc ctgatggcta 7020 
ttccctctcc tgtaggttgg tgacacagtg caactgatcc tgatgctgga gtaaaggtga 7080 
ctgactgatg ggagctcctg ggaaggtttg tcaggtcagg gaccacgacc ctcacaggtg 714 0 
aggtcacaca ggtagccgct gaatttccct ctttattgga agacagtgct ggcatctccc 7200 
tccgagcttt cctagtgtaa cacacacagg aagcacagga cttctctggg atgacacaaa 7260 
gtcactgctc agacctgggt aaaaaaaatc tgtatgctca gtgttccaca tcatagctaa 7320 
accaaagagc ttaattgggt gttggaattg tgggtcctat gacctcagag ccctatgatc 7380 
acactagaat ccagtgactc tggctctcaa gtttaatcat aaactgctgt gtgatcctgg 7440 
ctccagagga cctacactag gcatctccac cagccgtgta gcagctctac ctggctgtgt 7500 
agagcttgga ccatgggcag ggctggctgg agcccaagcc tttggatcta ggacccagtt 7560 
ctgcagcgat ccactaatca gtgtctcatt tgccttctac aggtcttgat agcctggctg 7 620 
gctcctgccc ctgtgccctt ggggtagact ttgcacccac ccctctgttt gttcctcaag 7 680 
tttctagttc tacagcaact tctgtcatgg tcagacctac tttttacatg atctactttc 7740 
tatgagcctg gcctatctct gtcctgaccg ttctccctag ctggagacaa actgcatctt 7800 
ctcttgccca ctttgtgcct ctggccctga ccttatagat ggcttttctt aagacccctc 7860 
ctccattcaa actcctccac ctgggcttct agtgccttgt gtggcaggag atttgtcccc 7 920 
atttacctta ggatgtctac aggtgcctct ggaggaggta cagataggga ctctatgccc 7 980 
tgtgggcctc tgacctatca ctggcaagaa gtttgcctgg tgctgggtag aggacactca 8040 
gtgctcagta gcttgaaggc acagtgggag acttacagag aacatctgga gtacttgtcc 8100 
aggtgctcag ttccttggct ccaggaagtg aatctgaccc taggcccctc ccaccccaac 8160 
ttgttttaaa gtaattaaac tttactgttt ttactt 8196 
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<210> 54 
<211> 4567 
<212> DNA 
<213> mouse 



<400> 54 

gtcctgcccc tctctgaacc tcatttaaga aggcacgcaa ccggacggcc tcattactat 60 
ggctgattca gttcattttc cctggatgcc attcccacct cgcttcctgg tctgcacaag 120 
agatgacatc tatgaggatg aaaatggtag acagtgggta gttgcaaaag tagaaacttc 180 
tcgttcacca tatggcagca gggtgagttt cagggctcca gtcaggttac cccctaactc 240 
ttacctttgt gtgctaggct gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 300 
gtgtttgtgt ttaggtaggt gctattattc tcactcatca tatggcaaca gggtgagttt 360 
cagggctcca gacaggctat ttcccaactc ttacccttgt gagctagggt gtgtttacgt 420 
gtttgtgtgt gtgtgtgtgt gtgtgtgttg tgtgtaggta tgtactttta ttactgtgtc 480 
agaaaatagc tgatacaggt ctctgaatag aaccttgtct atagctaggc ctataccctg 540 
gagcttgctt agaggcctca caggtccaac taatacacag atgaagagcc ctagcagtaa 600 
ggactccgtt acagatagtc agagagaaca tgctgagaaa gctttgttaa aaaaaagaaa 660 
gcaaaaaaaa aaaaaaaaaa aaaaaaagct gggctttgtg gaatggtagg aattctacag 720 
gtcattaaaa aaacaaaacg aacaaataac aaactctggt gccaggcaga ttccctggaa 780 
ggagcaaatg tgagtgccag cactagtttc cgaattcatg taaatcataa ttcatctgct 840 
gctgttgctc ccaggcttga gtggcctgaa gaagtgtttg acagacaagg gatccttaca 900 
ctaggagctg actacaggac ttgggggaag tcctaatggg ttctgtgatc tgcaggtgat 960 
gtagagaccc acactgctgg aaacctgctc tggaccatgg ctttagtcca cagcaggacc 1020 
ttgggggcta aaacactctc taatgaaggc tctgggtatg aaaactgctc cccagggttg 1080 
tggaaaagac tatgggctcc ctcctctgct ctgggggtga ggagggagtg gggatgttag 1140 
gactgtttta ctgccaaggt tttcctgcac cccatgccct gtcctacctt ctccttctcc 1200 
agcccagcag cccttccctg cagtttcctt gacttccaca atacctcagc cattttgttg 1260 
atgtaagttg cttggccccc tctgccttgc ccgttctttg ctatccccct tctgtccacc 1320 
ctgacatcac aacacaagta gaagggacaa actttgcaga ccttaggatg gcagccagga 1380 
aaaaaactct ccacagtgtg acctatcaat gtgtgtgtct ttagcttaac aaaacaaagg 1440 
tatggatcag gtagacagga ggacgctggt aggaagtctg tgacacactt tgtgatttgt 1500 
ggtcatgctt ggcacttggt tatggttttt aggaacgctt ttactttgtg gctttctgtg 1560 
tgggctcagg tttacttttt gcttccatga atggtacatg aagttcacca tcaacccttc 1620 
agtggatttc ttttggtaaa gccattgccc cctggtgtgt gtgcatgtgt ctgtgtgtgt 1680 
gtttattctg tgtgtttgtg tgcatgcctg ttcatgtgtg gatgtgtgtt ctgtgtgttt 1740 
gtgtacatac ttgatcatgt gtgcatatgt gtgtatgtat acaagtgtgt gcatttgcat 1800 
gtgtgtgtgt ctgtgtaaca ttcaggtgag ctgccctgtg actttcttat tcccttgata 1860 
aaacatcctt ccctaaatct gcagttacag tggcagccag ccaaccccag tggtcctcct 1920 
gcctcagctc cccaagcact ggtgcaaata taactgtagc tttttacttg ggtgctggtg 1980 
attcaatctt aggggctcct gcctgtgcgg caggtgctct tatctaccga ggcatctccc 2040 
cactctctgt cttacttttc agaagagaat tttgaaaact ctacaaatgg ccaagacaga 2100 
gccccaaggg gccatttagt gccatgaaag acactgctag cctggagaca gtggctactt 2160 
tgtgggacat accctggggt tgagcagggt tggtggaaga gagtgccaga gaggcagaag 2220 
agcagcagac accccaggcc ctgaggaaga agacctaggt gctggagaca aggctctggt 2280 
tagggatgtt gaccactgca gcaaacaagc taggcccagg catggaggat ggaaccctgc 2340 
tcctaatgta ccgctctctc tgttcccttc agattgaaac atgtataact gtgcacttgc 2400 
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agcatatgac cacaatcccc caggagccta ctccccagca gcccattaat aacaactctc 2460 

tccccacgat gtggagatta gagtccatga acacatacac gggaacagat gggacatact 2520 

ggagattact ggatcattcc caggcaagtg ttggttacct caagataaaa gctgcaggag 2580 

gtatcccatg cttttcattc tgacccaccc ctgggaggga ggttctccct ctcttccccc 2640 

aactctagct cttggtccat ttttagtgga tgattggaac ctatgatatg acaaaacttt 2700 

tatggggaga tgagacaaac tcaagtacct agcacagatt gggaacccac aacagaccaa 2760 

agtatgcaca gaacagaagc ccaacgtagt gaccagtgaa ctttactggg attacataca 2820 

ggaacaccag tgaggggtcg ctcacaggag ctgaaatgac tcagttgtgt gcatcaacaa 2880 

agaacttaat gcatggtgac tgacagttca caaaaacagg gaccttgaag cacagtgtcc 2940 

aggctacagt cagctaacag gtgggagagt gccctttcca ggtgcctctg ctggtctaaa 3000 

gcttttccag gcagctcggc tggttggatt cttctgggtc cttttgtctg agtcttttag 3060 

gcttgactct gctgcttctg gtagggagga gcagtgtgat tctgctcagt ttcagggagt 3120 

tcttgaagct acttttgttg ttgttacttc cctgcttcag gaagtgactg ctatgaaaga 3180 

aagggagaaa gggaaggaga gagaaggtaa agtgagaata cacatgggga cacttctgcc 3240 

tactggtgta ggtgtgaaag taatgcttcc ctctggctac agaatcgaga gaccaggtcc 3300 

ccttttgctc cttgtccctt cctgatgcct caggtctgaa tgggtcagta gggctggcta 3360 

gacttgcctt acaggactct gatggctatt ccctctctta cagatgggcg acacattgca 3420 

actgatcctg gacatagtaa tatgcgaggt tgactgaatg atggaagctg atgggaaggt 3480 

ttgtcagggt cagggaccac aaccgtcact ggtgaggtca cacaggtagc cgctgacttt 3540 

ccctctttat aggaagccag tgctggcatc tctccctgag ctttcctggt gtaacacaca 3600 

caggaagcac aggacctcac ggagatgacc taaagtcacc gttcagccct ggggggaaaa 3660 

acctgaatgt tctgtgttcc acatgatagc taagtaaaag agcactgggt gttggaattg 3720 

tgggtcctat gatctcagag ccctatgatc acactagaat ccagtgactc tggatctcaa 3780 

gtttaatcat acagtactgt gtgatcctgg ccacagggga cttacactgg gcatctcctc 3840 

caacttggta gcagctctct ctggctaggt agagcttgga ccatgggcag ggctggctgg 3900 

agcccaagcc tttggatcta ggacccagtt ctgcagcgat ccactaatca gtgtctcatt 3960 

tgccttctac aggtcttgat agcctggctg gctcctgccc ctgtgccctt ggggtagact 4020 

ttgcacccac ccctctgttt gttcctcaag tttctagttc tacagcacct cctgccatga 4080 

ttagacctgc tttttagatt atctactttc tgtgagcctg gcctaactct gtcctgaccg 4140 

ttctccctag ctggagacaa actgcatctt ctcttgctca ttttgtgcct ctggccctga 4200 

ccttatagat ggcttttctt aagacccctc ctccattcaa actcctccac ctgggcttct 4260 

agtgccttgt gtggcaggag atttgtcccc atttacctta ggatgtctac aggtgcctct 4320 

ggaggaggta cagataggga ctctatgccc tgtgggcctc tgagccatca ctggcaagaa 4380 

gttcgactgg tgctgggtag aggccactca gtgctcagta gcttgaaggc acagtgtgag 4 440 

acttacagag aacatctgga gtacttgtcc aggtgctcag ttccttggct ccaggaagtg 4500 

aatctgaccc taggcccctc ccaaccccaa cttgttttta actaattaaa ctttattgtt 4560 

tttactt 4567 



<210> 55 
<211> 4570 
<212> DNA 
<213> mouse 

<400> 55 

gtcattcccc tgtctgaccg catttaagaa ggcaggcaac cagacagctt cattactatg 60 
gctgattcag ttcgttttcc ctgtatgcca ttcccacctt gcttcctggt ctgcacaaga 120 
gatgacatct atgaggatga acatggtaga cagtgggtag ctgcaaaagt ggaaacttct 180 
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tctcattcac catattgcag caaggtgagt ttcagggctc cagtcaggtt acctttgtgt 240 
gctaggctgg gtgtgtgtgt gtgtgtgttg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 300 
tgtgtgtgtt taggtaggtg ctattattac cgtgtcagaa aatagctgat acaggtctct 360 
gaatagaacc ttgtctttac ctaggcccat accctggagc tcgcttggag gcctcacagg 420 
tccaactaat agacagatga agagccctag cagtgaggac tctgtcacca acagtcagag 480 
aaaacatgct gagaaagccc tattaggctg ggctttgtgg gatgagtagg agttctacag 540 
gtcagaaaaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 600 
aaagaaaatc aggtgcaagg cagattgagt ccctggaagg agcaaatgtg agtgccagca 660 
ctagtttccg aattcctgta aatcataatt catctgctgc tgttgctccc aggcttgagt 720 
ggcctgaaga agtgtttgac agacaaggga tccttacact aggagctgac tacaggactt 780 
gggggaagtc ctaatgggtt ctgtgatctg caggtgatgt agagacccat gctgctggaa 840 
acctgctctg gaccatggct ttagtccaca gcaggacctt gggggctaaa acactctcta 900 
atggaaggct ctgggcatca aaactgctcc ccagggttgt ggaaaagact atgggctccc 960 
tcctctgctc tgggggtgag gagggagtgg ggatgttagg actgtttgac tgccaaggtt 1020 
ttcctgcacc ccatgccctg tcctaccttc tccttctcct gcccagcagc ccttccctgc 1080 
agttcccttg atttccacaa tacctcaacc attttgttga tgtaagttgc ttggccccct 1140 
ctgccttgcc ygttctttgc tatccccctt ctgtccaccc tgacatcama acacaagyag 1200 
aagggacaaa ctttgcagac cttaggatgg cagccaggaa aaaaactctc cacagtgtga 1260 
cctatcaatg tgtgtgtctt tagcttaaca aaacaaaggy atggatcagg tagacaggag 1320 
gasgctggta ggaagtctgt gacacacttt gtgrtttgtg gtcatgcttg gcacttggtt 1380 
atggttttta ggaactcttt tactttgtgg ctttctctgt gtgttcaggt ttactttttg 1440 
catccatgaa tggtacatga agttcaccat caacccttca gtggatttct tttggtgaag 1500 
ccactgcccc ctggtgtgtg tgtgtgtgtt ttgtgtgttt gtgtgcatgc ctgttcatgt 1560 
gtccatgttt gtgtgttcag tgtgtttgtg tacatacttg atcatgtgtg catatgtgtg 1620 
tatgtataca agtgtgtgca tttgcatgtg tgtgtgtctg tgtaacattc agatgagctg 1680 
ccctgtgact ttcttattcc cttgataaaa catccttccc taaatctgca gttacagtgg 1740 
cagccagcca accccagtgg tcctcctgcc tcagctcccc aagcactggt gcaaatataa 1800 
ctgtagcttt ttacttgggt gctggtgatt caatcttagg ggctcctgcc tgtgcggcag 18 60 
gtgctcttat ctaccgaggc atctccccac tctccatctt atttttcaga agagaatttt 1920 
gaaaactcta caaatggcca agacagagcc ccaaggggcc atttagtgcc atgaaaggca 1980 
ttacctggag cctggaggca gtggctactt tgtgggagat accctggggt tgagcagggg 2040 
tggtggaaga gagtgccaga ggcagaagag cagcagacac cccaggccct gaggaaaaag 2100 
acctaggtgc tggagtcaag gttgtgttta ggggttttgt ccactgtagc aaacaagcca 2160 
gtcccaggct tggaggatgg aaccctgctc ctaatgtacc gctctctcta ttcccttcag 2220 
attgaaacct gtgtaactgt ccacttgtgg cagatgacca cactcttcca ggagcctagt 2280 
cccgactctc tcaagacttt caactttcta cccaggacat ggagattaga gtccaggaac 2340 
acataccggg gagcagatgc catgcactgg agattagtga atcattccca ggcaagtgtt 2400 
tgctacctca agataaaagc tgcagcaggc agctagcagg gcatggtggc gcacaccttt 2460 
aatcccagca ctcgggaggc agaggcaggc agatttctga gttcaaggcc agcctggtct 2520 
acagagtgag tgccaggaca gtcaggacta cacagagaga ccctctctca aaaaaccgaa 2580 
agaaaaaaaa aagccgcagc aggtgtccca tgcttttcat tttctcccac cccagggaag 2640 
gaggttcttc tccaccccct tgacccaatg tggccttttg tccattttta gtggacagtt 2700 
ggggtgtgtc atatgagaaa acttttttgg gggagatgag acaaactcaa gtacccagca 2760 
cagattggga acccacaaca gaccaaagta tgcacagaac agaagcccaa cgtagtgacc 2820 
agtgaacttt actggattat ataccggaac actggtgaga ggtcactcac aggagcagaa 2880 
atgagtcagt tggctgcatc accaaagccc accacagcct tggtgacaga caattcacaa 2940 
aaagctggtg ccttgacgca cattgcccag cctacagtca gctaacagat tggagagtgt 3000 
cctttccagg tgcctcctgt ggtctaaagc tcttccaggc agcttggctg gtcagcttct 3060 
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tctgggttct tttgtctgag tctttgaggc ttgactctgc tgcttctgac agggaggggc 3120 

agtgtgattc tgctcagttt cagggagttc ttaaagctac ttttgttgtt gttacttccc 3180 

tgactcagga agtgactgct atgaaagaaa gggagaaagg gaaggagaga gaaggtaaag 3240 

tgagaataca catggggaca cttctgccta ctggtgtagg tgttaaagta atgcttccct 3300 

ctggctacag aatccagaga ccaggtcccc ttttgctcct gtcccttcct gatgcctcag 3360 

gtctgaatgg gtcactaggg ctagctagac ttgccttaca ggatcctgat ggctgttccc 3420 

tttcctgcag ttttatggca cagaggaact ggtcctgatg ctggattcaa ggtaactgac 3480 

taatggaagc tgctgggaag gtttgtcagg gtcagggacc acggccctca ctggtgaggt 3540 

cacacaggta gccgctgact ttccctcctt aaaggaagac aatgctggca tctctccctg 3600 

agctttcctg gtgtaacaca cacaggaagt acaggacctc actgggatga cctaaagtca 3660 

cctctcagcc ctgggtaaag aaaaaaaaaa tctgtatgtt cagtgttcca catcatagct 3720 

aagccaaata gcttagttga gtgttggaat tgtgagtcct acgatcacaa tagaatccag 3780 

tgactctggc tgtcaagttt aatcatacag tgctgtgtga tcctggccac agggtaccta 3840 

cactgggcat ctccccagct tggtagcagc tctccctggc tgtgtagagc ttggaccatg 3900 

ggcagggctg gctggagccc aagccttgga tctaggaccc agttctgcag cgatccacta 3960 

atcagtgtct catttgcctt ctacaggtct tgatagcctg gctggctcct gcccctgtgc 4020 

ccttggggca gactttgcac ccacctctct gcttgttcct caagtttcta gttctacaac 4080 

acctcctgcc atggtctttt tacatgatct tctttctgtg agcctggcct atctgtggct 4140 

tgacggttct ccctagctgg aggcaaactg catcttctct tgctcacttt gtgcctttga 4200 

cactgacctt atagatggct cttaagaccc ctcttctctt caaaatccta cacctgggct 4260 

tctagtgcct tgtgtggcag gagatttgtc cccatttacc ttaggagttc tagaggtgcc 4320 

cctaaaggag gtacagatag agactctatg ccgtgtgggc ctctgagcca tcactggcaa 4380 

gaagttcgcc tggtgctggg tagaggacac tcagtgctca gtagcttgaa ggcacagtgg 4440 

gagacttaca gagaacatct ggaggacttg tccaggtgct cagttccttg gctccaggaa 4500 

gtgatgctga ccctagaccc ctcccaaccc caacttgttt ttaactaatt aaactttact 4560 

gtttttactt 4570 



<210> 56 
<211> 4786 
<212> DNA 
<213> mouse 



<400> 56 

gtcattcccc tgtctgaccg catttaagaa ggcaggcaac cagacagcct cattactatg 60 

gctgctgtgt ctgttgatcc ccagaggcca ctcccagtcc tcctggtctc tgttagcctg 120 

ggcatctatg aggatgaaca tcatagagtg tggatagctg taaacgtgga aacttctcat 180 

tcatcacatg gcaacagggt gagtttcagg gctccagaca ggttattccc caactcttat 240 

acctagtggg ctagggtgtg tgtgtgtgtg tgtgtgtgtg tgtgtaggta ggtgttgtta 300 

ttaccatttc agaaaatagc tgacacaggt ctctgaataa accttttcta tacctaggcc 360 

cataccctgg agctctcttg gaggcctcac aggtccaact aatacacagg tgcagagccc 420 

tagtagtgag gactctgtca ccaacagtca gagaaaacat gctgagaaag ctgggctttg 480 

tgggatgagt aggagttcta caggtcagaa aaaaaaaaaa aaaaaaaaag caactctggt 540 

gccaggcaga ttccctggaa ggagcaaatg tgagtgccag cactactttc cccatttctg 600 

caaatcataa ttcatctgct gctgttgctc ccagacttga gtggcctgaa gaagtgtttg 660 

acagcgaagg gatccttaca ctaggagctg actacgggac ttgggggaag tcctaatggg 720 

ttctgtgatc tgcaggtgat gtagagaccc acactgctgg aaacctgctc tggaccatgg 780 

ctttagtcca cagcaggtcc ttgggggcta aaacactctc taatgaaggc tctgggtatc 840 
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aaaactgctc cccagtgtcc tggaagagac tatgggacac ctcccctgtc tgggggtgga 900 

gaggaaatag gggcgttagg tctctttgaa tgctaggttt ttcctgcagc cccatcaccc 960 

tatcctaacc ttttctagcc cagcagccct tccctgaagc tcccctgact ccctgaatac 1020 

ctcagccatt ttggcaatgg gactctcttt gtcccttatc ctctccctcc tcgcaggaag 1080 

gaaggaagga aggtagaaag gaaaaaagga agggagggag ggagggagga aggaagggaa 1140 

ggaagggaag gaaggaagga aaaaagaaag aaaagaagaa agaaacagaa aagaattttg 1200 

ttagggtata aatgaaagag ctgcaaagga aataatgtgt agtcaattaa attcaaacaa 1260 

aggcacatgt ctcccagcca ttctgcttct gggctgggct tgtgttgaag cagccattgg 1320 

tcccacgtgt gcatagaggg gcaggtgcag caaccaaatg agctccttgt acaactaagt 1380 

gtaaaggaga gggggcaggg aagggtcgga tggcatttgt aaaaggagga acctgaggat 1440 

ccaggcagag ctgtctctaa gggatacatc caccagaggt gccctgtgga gtgccttcac 1500 

agagacagaa agcacaatgg tgggagggag ggggagttcc aagaccctat ggttgccata 1560 

ggtttagagc aggtgagttt acgaaaagac tctgggatgg gatgggggcc atctcaggac 1620 

cccccttacc acccccgtgc accccgaaat caccacagtg gccaggaaaa ggttccatag 1680 

gaatagaaag attggagaaa tttaggcaca ggtggccctg aattatttcc ctaattaaga 1740 

caatttcccc tctgtttgtc tgtctctcta tcaacctaca gattacagta acgaagactt 1800 

ttcaacatgc taccaaaatg ttccacaact gaggcacatt acagcctcct tttacttttt 18 60 

atgttgcaac agggtctcat gtgtgcatgt atgtgtgtct gtgtgtctct gtgtgtgttc 1920 

tgtatgtttg tgtacatacc tgttcatatg tgcatatgtg tgtgtgtgta tacaagtgtg 1980 

tgcatttgca tgtgtgtgtg tctgtgtaac attcaggtga gctgccctgt gactatctta 2040 

ttcccttgat aaaacatcct tccctaaatc tgcagttaca atggcagcca gccaatccca 2100 

gtggtcctcc tgcctcagct ccccaagcac tggtgcaaat ataactgtag ctttttactt 2160 

gggtgctggt gattcaaact taggggctcc tgcctgtgcg gcaggtgctc ttatctaccg 2220 

aggcatctcc ccactctctg tcttactttt cagaagagaa ttttgaaaac tctacaaatg 2280 

gccaagacag agccccaagg ggccatttag tgccatgaaa gacactgcta gcctggagac 2340 

agtggctact ttgtgggaca taccctgggg ttgagcaggg ttggtagaag agagtgccag 2400 

agaggcagaa gagcagcaga caccccaggc cctgaggaag aagacctagg tgctggagac 2460 

aaggctctgg ttagggatgt tgaccactgc agcaaacaag ctaggcccag gatggaggat 2520 

ggaaccctgc tcctaatgta ccgctctctc tgttcccttc agattgaaac atgtgtaact 2580 

gtgcacttgc agcatatgac cacactcccc caggagccta ctccccagca gcccattaat 2640 

aacaactctc tccccacgat gtggagatta gagtccagga acacatacac gggaacagat 2700 

gggacatact ggagattact ggatcattcc caggcaagtg ttgggtacct caagataaaa 2760 

gctgcagcag gtgtccagtg cttttcattc tgacccaccc ctgggaggga ggttctccct 2820 

ctcttccccc aagtctagcc cttggtccat ttttagtgga cgattggaac ctatgatatg 2880 

acaaaacttt tatggggaga tgagacaaac tgaagtaccc agcacagatt gggaacccac 2940 

aacagaccaa agtgtgcaca gaacagaagc ccaaattggt gaaccagtga actttactgg 3000 

gattacatac aggaacacca gtgaggggtc gctcgcagga gctgaaatga ctcagttgcg 3060 

tgcatcaaca aagaacttaa tggtattggt gacagacagt tcacaaaaac ggggaccttg 3120 

aagcacagtg tccaggctac agtcagctaa caagttggag agtgcccttt ccaagtgcct 3180 

ctgctggtct aaagctcttc caggcagctg gtcggattct tctgggtcct tttgtctgag 3240 

tctttgaggc ttgactctgc tgcttctgac agggagggac agtgtgattc tgctcagttt 3300 

cagggagttc ttgaagctac ttttgtagtt gttacttccc tgctttagga agtgactgct 3360 

atgaaagaaa aggtcagagg gaaggagaga ggaggcagag agggagaggg aggaggggag 3420 

agagaaggta gagagtgaat gcccatgggg acacttctgc ctactggtgt aggtgtgaaa 3480 

gtaatgcttc cctctggcta cagaatccag agaccaggtc cccttttgct acttgtccct 3540 

tcctgatgcc tcaggtctga atgggttact agggctggct agatttgcct tacaggaccc 3600 

tgatggctat tccctctcct acagatgggc gacacagtgc aactgaccct ggacataata 3660 

ataggcgagg atgactgaat gatgggagct gctgggaagg tttgtcaggg tcagggacca 3720 
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cgaccctcac tggtgaggtc acacaggtag 
gtgctggcat ctctccctga gctttccttg 
tgggatgacc taaagtcacc gctcagccct 
catcatagct aagtaaaaga gcactgtgtg 
cctatgatca cactagaatc cactgactgt 
tgatcctggc tccagaggac ctacactagg 
tggctgtgta gagattggac catgggcagg 
gacccagttc tgcagcgatc cactaatcag 
gcctggctgg ctcctgcccc tgtgccctcg 
ttcctcaagt ttctagttct accacacctc 
tctactttct gtgagccttg cctatctctg 
ctgcatcttc tcttgctcac tttgtgcctc 
agacccctcc tccattcaaa ctcctccacc 
tttgtcccca tttaccttag gatgtataca 
tctatgccct gtgggcctct gagccatcac 
ggacactacg tgctcagtag cttgaaggca 
tacttgtcca ggtgctcagt tccttggctc 
caaccccaac ttgtttttaa ctaattaaac 



ctgctgactt tccctcttta taggaagcca 3780 
tgtaacgcac acaggagcac aggacctcac 3840 
gggggaaaaa cctgaatgtt ctgtgttcca 3900 
ttggaattgt gggtcctatg atctcagagt 3960 
gtctctcaag tttaatcata atctgctgtg 4020 
catctccacc agccttgtag cagctctacc 4080 
gctggctgga gcccaagcct ttggatctag 4140 
tgtctcattt gccttctaca ggtcttgata 4200 
gggtagactt tgcacccaac cctctgcttg 4260 
ctgccatgac tagacctgct ttttagatta 4320 
tcctgaccat tctccctagc tggagacaaa 4380 
tgacactgac cttatagatg gcttttctta 4440 
tgggcttcta gtgccttgtg tggcaggaga 4500 
ggtgcctctg gaggaggtac agatagggac 4560 
tggcaagaag ttcgactggt gctgggtaga 4 620 
cagtgggaga cttacagaga atatctggag 4680 
caggaagtga atctgaccct aggcccctcc 4740 
tttactgttt ttactt 4786 



<210> 57 
<211> 116 
<212> PRT 
<213> mouse 

<400> 57 
Met Ala Ala Ala 
1 

Val Ser Val Arg 

20 

lie Val Ala Asn 
35 

Arg Thr His Val 
50 

lie Pro Phe Asn 
65 

Leu Glu Ser Arg 



Leu Leu Asp His 

100 



Ala Phe Asp Pro 
5 

Leu Gly lie Tyr 



Val Glu Thr Ser 

40 

Thr Val His Leu 
55 

Pro Leu Asn Tyr 
70 

Asn lie Tyr Trp 
85 

Ser Gin Leu Gly 



Leu Gly Pro Leu 
10 

Glu Asp Glu His 
25 

His Ser Ser His 



Trp Lys Leu lie 

60 

Asp Phe Leu Pro 
75 

Ala Thr Asp Gly 
90 

Asp Thr Glu Gin 
105 



Pro Val Tyr Leu 
15 

His Arg Val Trp 
30 

Gly Asn Arg Arg 
45 

Pro Gin Gin Val 



Thr Thr Trp Lys 

80 

Thr His Trp Arg 
95 

Leu lie Leu Met 
110 



Leu Val Leu Gly 



115 



<210> 58 
<211> 129 
<212> PRT 
<213> mouse 

<400> 58 

Met Ala Ala Ala Ala Phe Asp Pro Leu Gly Pro Leu Pro Val Tyr Leu 
15 10 15 

Val Ser Val Arg Leu Gly lie Tyr Glu Asp Glu His His Arg Val Trp 

20 25 30 

lie Val Ala Asn Val Glu Thr Ser His Ser Ser His Gly Asn Arg Arg 
35 40 45 

Arg Thr His Val Thr Val His Leu Trp Lys Leu lie Pro Gin Gin Val 
50 55 60 

lie Pro Phe Asn Pro Leu Asn Tyr Asp Phe Leu Pro Thr Thr Trp Lys 
65 70 75 80 

Leu Glu Ser Arg Asn He Tyr Trp Ala Thr Asp Gly Thr His Trp Arg 

85 90 95 

Leu Leu Asp His Ser Gin Val Leu He Ala Trp Leu Ala Pro Ala Pro 

100 105 110 

Val Pro Ser Gly Arg Leu Cys Thr His Pro Ser Val Cys Ser Ser Ser 
115 120 125 

Phe 



<210> 59 
<211> 117 
<212> PRT 
<213> mouse 

<400> 59 

Met Ala Ala Ala Gly Phe Tyr Pro Pro Arg Leu Leu Pro Gin Val Leu 
15 10 15 

He Ser Thr Gly Pro Gly Phe Tyr Glu Asp Glu His His Arg Leu Trp 

20 25 30 



37 



Met Val Ala Lys Leu Glu Thr Cys Ser His Ser Pro Tyr Cys Asn Lys 
35 40 45 



He Glu Thr Cys Val Thr Val His Leu Trp Gin Met Thr Arg Tyr Pro 
50 55 60 

Gin Glu Pro Ala Pro Tyr Asn Pro Met Asn Tyr Asn Phe Leu Pro Met 
65 70 75 80 

Thr Trp Arg Leu Ala Ser Met Asn Thr Tyr Arg Gly Thr Asp Ala Met 

85 90 95 

His Trp Arg Leu Leu Asn His Ser Gin Val Gly Asp Thr Val Gin Leu 

100 105 HO 

He Leu Met Leu Glu 
115 



<210> 60 
<211> 122 
<212> PRT 
<213> mouse 

<400> 60 

Met Ala Asp Ser Val His Phe Pro Trp Met Pro Phe Pro Pro Arg Phe 
15 10 15 

Leu Val Cys Thr Arg Asp Asp He Tyr Glu Asp Glu Asn Gly Arg Gin 

20 25 30 

Trp Val Val Ala Lys Val Glu Thr Ser Arg Ser Pro Tyr Gly Ser Arg 
35 40 45 

He Glu Thr Cys He Thr Val His Leu Gin His Met Thr Thr He Pro 
50 55 60 

Gin Glu Pro Thr Pro Gin Gin Pro He Asn Asn Asn Ser Leu Pro Thr 
65 70 75 80 

Met Trp Arg Leu Glu Ser Met Asn Thr Tyr Thr Gly Thr Asp Gly Thr 

85 90 95 

Tyr Trp Arg Leu Leu Asp His Ser Gin Met Gly Asp Thr Leu Gin Leu 

100 105 HO 

He Leu Asp He Val He Cys Glu Val Asp 
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115 



120 



<210> 61 
<211> 107 
<212> PRT 
<213> mouse 

<400> 61 
Met Arg Leu Ser 
1 

Ser Gly His Arg 

20 

Leu lie Glu Thr 
35 

Pro Gin Glu Pro 
50 

Thr Met Trp Arg 
65 

Thr Tyr Trp Arg 



Leu lie Leu Asp 

100 



Gly His Arg Gly 
5 

Ala Leu Gin Arg 



Cys lie Thr Val 

40 

Thr Pro Gin Gin 
55 

Leu Glu Ser Met 
70 

Leu Leu Asp His 
85 

lie Val He Cys 



Leu Gin Trp Ala 
10 

Ala Ser Leu Lys 
25 

His Leu Gin His 



Pro He Asn Asn 

60 

Asn Thr Tyr Thr 
75 

Ser Gin Met Gly 
90 

Glu Val Asp 
105 



Ser Leu Arg Phe 
15 

Leu Ser Gly His 
30 

Met Thr Thr He 
45 

Asn Ser Leu Pro 



Gly Thr Asp Gly 

80 

Asp Thr Leu Gin 
95 



<210> 62 
<211> 112 
<212> PRT 
<213> mouse 

<400> 62 

Met Pro Phe Pro Pro Cys Phe Leu Val Cys Thr Arg Asp Asp He Tyr 
15 10 15 

Glu Asp Glu His Gly Arg Gin Trp Val Ala Ala Lys Val Glu Thr Ser 

20 25 30 

Ser His Ser Pro Tyr Cys Ser Lys lie Glu Thr Cys Val Thr Val His 
35 40 45 

Leu Trp Gin Met Thr Thr Leu Phe Gin Glu Pro Ser Pro Asp Ser Leu 
50 55 60 

39 



Lys Thr Phe Asn Phe Leu Pro Arg Thr Trp Arg Leu Glu Ser Arg Asn 
65 70 75 80 



Thr Tyr Arg Gly Ala Asp Ala Met His Trp Arg Leu Val Asn His Ser 

85 90 95 

Gin Phe Tyr Gly Thr Glu Glu Leu Val Leu Met Leu Asp Ser Arg Ser 

100 105 110 



<210> 63 
<211> 121 
<212> PRT 
<213> mouse 

<400> 63 
Met Ala Ala Val 
1 

Val Ser Val Ser 

20 

lie Ala Val Asn 
35 

Glu Thr Cys Val 
50 

Glu Pro Thr Pro 
65 

Trp Arg Leu Glu 



Trp Arg Leu Leu 

100 

Leu Asp lie lie 
115 



Ser Val Asp Pro 
5 

Leu Gly lie Tyr 



Val Glu Thr Ser 

40 

Thr Val His Leu 
55 

Gin Gin Pro lie 
70 

Ser Arg Asn Thr 
85 

Asp His Ser Gin 



lie Gly Glu Asp 

120 



Gin Arg Pro Leu 
10 

Glu Asp Glu His 
25 

His Ser Ser His 



Gin His Met Thr 

60 

Asn Asn Asn Ser 
75 

Tyr Thr Gly Thr 
90 

Met Gly Asp Thr 
105 

Asp 



Pro Val Leu Leu 
15 

His Arg Val Trp 
30 

Gly Asn Arg lie 
45 

Thr Leu Pro Gin 



Leu Pro Thr Met 

80 

Asp Gly Thr Tyr 
95 

Val Gin Leu Thr 
110 



40 



